Purified and isolated platelet calcium channel nucleic acids and polypeptides and therapeutic and screening methods using same

ABSTRACT

Isolated and purified platelet voltage dependent calcium channel (VDCC) α 1  subunit polypeptides, and nucleic acid molecules encoding the same. Recombinant host cells, recombinant nucleic acids and recombinant proteins are also disclosed, along with methods of producing each. Isolated and purified antibodies to platelet VDCC α 1  subunit polypeptides, and methods of producing the same, are also disclosed. Platelet VDCC α 1  subunit polypeptides have biological activity in calcium transport. Thus, therapeutic and diagnostic methods involving this activity are also disclosed.

CROSS REFERENCE TO RELATED APPLICATIONS

[0001] This application is based on and claims priority to U.S. Provisional Patent Application Serial No. 60/258,169 filed Dec. 22, 2000, the entire contents of which are herein incorporated by reference.

GRANT STATEMENT

[0002] This work was supported by NIH grants I-P20-DE123474 and I-P60-DE 13079. Thus, the U.S. Government has certain rights in the invention.

FIELD OF THE INVENTION

[0003] The present invention relates generally to isolated and purified proteins and nucleic acids. More particularly, the present invention relates to isolated and purified platelet calcium channel polypeptides and isolated and purified nucleic acids encoding the same. Table of Abbreviations ASO allele-specific oligonucleotide A-T ataxia-telangiectasia BSA bovine serum albumin CDR complementarity determining region ECL electrochemiluminescence EST expressed sequence tag fl full length HAT cell culture media comprising hypoxanthine, aminopterin, and thymidine IMAGE integrated molecular analysis of genomes and their expression kDa kilodalton(s) KLH keyhole limpet hemocyanin L liter(s) LAT ligation activated translation LCR ligase chain reaction MAb monoclonal antibody MAb 1A Monoclonal Antibody 1A mL milliliter(s) NASDA ™ nucleic acid sequence-based amplification nm nanometer(s) nt nucleotide(s) ORF open reading frame PBS/BSA phosphate buffered saline/bovine serum albumin PCR polymerase chain reaction RACE rapid amplification of conserved ends r.t. room temperature RT-PCR reverse transcriptase polymerase chain reaction SSCP single strand conformation polymorphism SDA strand displacement activation SNP single nucleotide polymorphism wt wild type SDS-PAGE sodium dodecyl sulfate polyacrylamide gel electrophoresis VDCC voltage dependent calcium channel μg microgram(s)

BACKGROUND OF THE INVENTION

[0004] Calcium influx into the platelet controls important processes during platelet activation. Sage, S. O. Exp Physiol. (1997) 82:807-823; Mills, D. C. Thromb Haemost (1996) 76:835-856. The pathway for this calcium entry is not well understood. Recently, gating of an ADP receptor (P_(2×)) has been suggested to be responsible for rapid calcium (Ca²⁺) influx into the platelet. Mackenzie, A. B., et al., J Biol Chem. (1996) 271:2879-2881; Sun B., et al., J Biol Chem. (1998) 273:11544-11547. Indeed, several ions, including Ca²⁺, Mg²⁺ and Na⁺ enter when this receptor is activated. However, calcium entry via this receptor is believed not to be sufficient for the processes that take place during platelet activation. Jin, J. and Kanapuli S., Proc Natl Sci U S A. (1998) 95:8070-8074; Kanapuli, S., Trends Pharmacol Sci. (1998) 19:391-394.

[0005] P_(2×) is an ionotropic channel, and its gating in other cell types, such as skeletal muscle cells, has an excitatory depolarizing effect that activates voltage dependent calcium channels (VDCCs) (Bean, B. P., Trends Phys Sci. (1993) 13:87-90; Surprenant A., Trends Neurosci. (1995) 18:224-229; Abbrachio, M. P., et al., Pharmacol Ther. (1994) 64:445-475. It is the activation of these calcium selective VDCCs that triggers calcium dependent events inside these cells, such as secretion and contraction. Ashcroft, F. M., Ion channels and disease. San Diego, Calif.: Academic Press; (2000); Boyd, A. E., J Cell Biochem. (1992) 48:234-261; Hille B., Ionic channels of excitable membranes. Sunderland, Mass.: Sinauer Associate, Inc. publishers; (1992); Armstrong, C. M. and Hille, B., Neuron (1998) 20:371-380; Berridge, M. J., J Physiol (Lond). (1997) 499.2:291-306.

[0006] The presence of VDCCs in platelets is controversial (Sage, S. O., Exp Physiol. (1997) 82:807-823), and heretofor their expression has not been investigated at the molecular level. Thus, in view of the role of calcium influx in controlling processes during platelet activation, the identification of a VDCC polypeptide in platelets represents a long-felt and continuing need in the art.

SUMMARY OF THE INVENTION

[0007] The present invention discloses an isolated and purified polynucleotide encoding a platelet voltage dependent calcium channel (VDCC) α₁ subunit polypeptide, an isolated and purified platelet VDCC α₁ subunit polypeptide, and a characterization of the role played by a platelet VDCC α₁ subunit polypeptide in modulating calcium transport during platelet activation. Optionally, a polypeptide of the invention is a recombinant polypeptide. The platelet VDCC α₁ subunit polypeptide can comprise a platelet VDCC α₁S subunit polypeptide or a platelet VDCC α₁D subunit polypeptide. Preferably, a polypeptide of the present invention comprises a nucleotide or amino acid sequence selected from the sequences of any of SEQ ID NOs:1-8, 28, and 29.

[0008] The present invention also provides an isolated and purified polynucleotide that encodes a platelet VDCC α₁ subunit polypeptide that modulates the levels of calcium in platelets as well as biological activities affected thereby. Optionally, a polynucleotide of the present invention comprises a DNA molecule from a mammal, including a pig and a human. Preferably, a polynucleotide of the present invention encodes a polypeptide comprising an amino acid sequence of SEQ ID NO:2 or 4. Most preferably, an isolated and purified polynucleotide of the invention comprises a nucleotide sequence of any of SEQ ID NOs:1, 3, 5-8, 28, and 29. The present invention further provides recombinant nucleic acid molecules comprising disclosed seqeunces, including vectors and chimeric genes. Also provided are host cells comprising disclosed VDCC α₁ subunit sequences. A preferred host cell is a platelet or a megakaryoctye.

[0009] In another embodiment, the present invention provides an antibody that specifically binds a platelet VDCC α₁ subunit polypeptide as described above. SEQ ID NOs:1-8, 28, and 29 set forth nucleotide and amino acid sequences from exemplary mammals, pig and human. More preferably, the antibody of the invention specifically binds a platelet calcium channel polypeptide comprising a human or porcine platelet VDCC α₁ subunit polypeptide. Even more preferably, an antibody of the invention specifically binds a polypeptide comprising an amino acid sequence of SEQ ID NO:2 or 4. Also provided by the present invention are antibodies that specifically bind homologues or biologically equivalent platelet VDCC α₁ subunit polypeptides. Optionally, an antibody of the invention is a monoclonal antibody.

[0010] In another aspect, the present invention provides a process of producing an antibody that specifically binds a platelet VDCC α₁ subunit polypeptide as described above, the process comprising: (a) transfecting a recombinant host cell with a polynucleotide that encodes a biologically active platelet VDCC α₁ subunit polypeptide; (b) culturing the host cell under conditions sufficient for expression of the polypeptide; (c) recovering the polypeptide; and (d) preparing the antibody to the polypeptide. SEQ ID NOs:1-8, 28, and 29 set forth nucleotide and amino acid sequences from representative mammals, human and pig. Preferably, the host cell is transfected with a polynucleotide of any of SEQ ID NOs:1, 3, 5-8, 28, and 29. Even more preferably, the present invention provides an antibody prepared according to the process described above. Also provided by the present invention is the use of homologues or biologically equivalent platelet VDCC α₁ subunit polynucleotides and polypeptides found in other mammals to produce antibodies.

[0011] Alternatively, the present invention provides a process of detecting a platelet VDCC α₁ subunit polypeptide as described above, wherein the process comprises immunoreacting the polypeptide with an antibody prepared according to the process described above, forming an antibody-polypeptide conjugate wherein the antibody specifically binds the VDCC α₁ subunit polypeptide, and detecting the conjugate.

[0012] In another aspect, the present invention provides an assay or assay kit for detecting the presence of a platelet VDCC α₁ subunit polypeptide in a biological sample, where the kit comprises a first antibody capable of immunoreacting with a platelet VDCC α₁ subunit polypeptide. Preferably, the first antibody is present in an amount sufficient to perform at least one assay. Also preferably, an assay kit of the invention further comprises a second antibody that immunoreacts with the first antibody. More preferably, the antibodies used in an assay kit of the present invention are monoclonal antibodies. Even more preferably, the first antibody is affixed to a solid support. More preferably still, the first and second antibodies comprise an indicator, and, preferably, the indicator is a radioactive label, a fluorescent label or an enzyme.

[0013] In another embodiment, the present invention provides an assay or assay kit for detecting the presence, in a biological sample, of an antibody that specifically binds a platelet VDCC α₁ subunit polypeptide, the kit comprising a first container containing a platelet VDCC α₁ subunit polypeptide that specifically binds the antibody, with the polypeptide present in an amount sufficient to perform at least one assay.

[0014] The present invention also provides a method for detecting a nucleic acid molecule that encodes a platelet VDCC α₁ subunit polypeptide in a biological sample. According to the method, a biological sample containing nucleic acid material is obtained, and a nucleic acid molecule of the present invention is hybridized under stringent conditions to the nucleic acid material of the sample. Under these conditions a hybridization duplex comprising a VDCC α₁ subunit sequence of the present invention and a VDCC α₁ subunit sequence of the present invention is formed. Detection of the hybridization duplex identifies a platelet VDCC α₁ subunit sequence in the biological sample.

[0015] A method is further provided for identifying a mutation conferring altered platelet VDCC α₁ subunit activity. In one embodiment, the method includes the steps of amplifying nucleic acid molecules in a sample, and then evaluating whether a mutation is present in the amplified nucleic acid molecule. In another embodiment, a platelet VDCC α₁ subunit mutation is detected by evaluating a platelet VDCC α₁ subunit subunit polypeptide in a biological sample.

[0016] Also provided is a method for detecting a polymorphism in a nucleic acid molecule that encodes a platelet VDCC α₁ subunit polypeptide. According to the method, a nucleic acid molecule in a sample is amplified using primers the selectively recognize a nucleic acid molecule encoding a platelet VDCC α₁ subunit polypeptide. A polymorphism can be identified in such an amplified platelet VDCC α₁ subunit sequence. The present invention also provides a kit for detecting a platelet VDCC α₁ subunit polymorphism.

[0017] In one embodiment, the present invention provides genetic assays based on the sequence of the platelet VDCC α₁ subunit genes. Platelet VDCC α₁, subunit sequences can be employed in the design of oligonucleotide primers suitable for the analysis of human genomic DNA. Thus, primers are used to screen for genetic variants by a number of PCR-based techniques, including single-strand conformation polymorphism (SSCP) analysis, SSCP/heteroduplex analysis, enzyme mismatch cleavage, and direct sequence analysis of amplified exons. Similar techniques can be applied to putative 5′-regulatory regions, e.g. the putative promoters 5′ of an exon of platelet VDCC α₁ subunit polypeptide. Automated methods can also be applied to the large-scale characterization of single nucleotide polymorphisms within and near a platelet VDCC α₁ subunit polypeptide.

[0018] Once genetic variants have been detected in specific populations, the present invention provides assays to detect the mutation by methods such as allele-specific hybridization, or restriction analysis of amplified genomic DNA containing the specific mutation. Again, these detection methods can be automated. In the case of genetic disease or human phenotypes caused by repeat expansion, the present invention provides an assay based on PCR of genomic DNA with oligonucleotide primers flanking the involved repeat.

[0019] In another aspect, the present invention provides methods for identifying candidate substances for an ability to modulate platelet VDCC α₁ subunit activity. According to the method, a test sample is established comprising a nucleic acid molecule encoding a platelet VDCC al, subunit polypeptide, or functional portion thereof. A candidate substance is administered to the test sample, and an interaction, effect, or combination thereof, of the candidate substance on the test sample is assayed. Preferably, the candidate substance is a candidate protein, a peptide, an antibody, a chemical compound, or a nucleic acid. In one embodiment, the test sample used according to the method comprises a cell culture expressing a VDCC α₁ subunit polypeptide. The present invention also provides a recombinant cell line suitable for use in such a method. In another embodiment, the screening method comprises a modulatable transcriptional regulatory sequence of a platelet VDCC α₁ subunit polypeptide-encoding sequence. In this case, a candidate substance as a modulator of platelet VDCC α₁ subunit activity is based on the amount of signal produced in relation to a control sample. An exemplary reporter gene encodes a platelet VDCC α₁ subunit polypeptide.

[0020] In still a further embodiment, this invention pertains to therapeutic methods based upon the modulation of the biological activity of platelet calcium channel via the polynucleotides and polypeptides described herein. According to the method, an effective amount of a substance that modulates platelet VDCC α₁ subunit activity is administered to a cell whereby the acitivity of platelet VDCC α₁ subunit is modulated in a predictable manner. Preferably, administration of the therapeutic composition modulates calcium transport in a cell. In one embodiment, therapeutic methods are provided wherein a modulator of VDCC α₁ subunit activity that was identified according to the disclosed methods, such a modulator including but not limited to a protein, a peptide, an antibody, a chemical compound, and a nucleic acid. In another embodiment, gene therapy approaches are provided using an isolated and purified polynucleotide of the present invention. Such methods of modulating the biological activity of a platelet calcium channel polypeptide are also applicable in the laboratory and/or clinical setting to enhance the capability to store or otherwise manipulate platelets for therapeutic or diagnostic purposes.

[0021] In yet another aspect, the present invention provides a genetically modified animal. In one embodiment of the present invention, the genetically modified animal can comprise a pig with targeted modification of a pig platelet VDCC α₁ subunit gene and can further comprise pig strains with complete or partial functional inactivation of the platelet VDCC α₁ subunit polypeptide genes in megakaryocytes and platelets. In an alternative embodiment, a genetically modified animal in accordance with the present invention is prepared using an anti-sense or ribozyme platelet VDCC α₁ subunit construct, driven by a universal, tissue-specific, or inducible promoter, to reduce levels of individual VDCCs in megakaryocytes and platelets, thus achieving a “knock-down” of individual isoforms.

[0022] The present invention also provides animal strains with specific “knocked-in” modifications in a platelet VDCC α₁ subunit gene. This includes pigs and mice with genetically and/or functionally relevant point mutations in the VDCC genes, in addition to manipulations such as the insertion of disease-specific repeat expansions. The present invention also provides the generation of animal strains with conditional inactivation of individual or multiple platelet VDCC α₁ subunit polypeptide genes by creation of a conditional mutation.

[0023] Thus, a key aspect of this invention pertains to the discovery of novel platelet VDCC α₁ subunit polypeptides and nucleic acids. Preferred nucleotide and amino acid sequences are described in SEQ ID NOs:1-8, 28, and 29. It is thus another aspect of this invention to provide a purified and isolated platelet VDCC α₁ subunit polypeptide having a role in the biological activity of calcium transport modulation.

[0024] The foregoing aspects and embodiments have broad utility given the biological significance of calcium transport in platelets. By way of example, the foregoing aspects and embodiments are useful in the preparation of screening assays and assay kits that are used to identify compounds that affect or modulate platelet calcium channel biological activity, or that are used to detect the presence of the proteins and nucleic acids of this invention in biological samples.

[0025] Accordingly, it is an object of the present invention to provide a novel platelet VDCC α₁ subunit polypeptide, and to provide a novel polynucleotide encoding the same. The object is achieved in whole or in part by the present invention.

[0026] An object of the invention having been stated herein above, other objects will become evident as the description proceeds when taken in connection with the accompanying Figures and Laboratory Examples as best described herein below.

BRIEF DESCRIPTION OF THE DRAWINGS

[0027]FIG. 1A is a schematic presentation of platelet VDCC α₁ subunit polypeptides cloned from human megakaryocytes and porcine platelets. A human platelet VDCC α₁S subunit clone (hα₁S, 3130 bp) from human megakaryocytes encodes a polypeptide between III S₄ and the CO₂ ⁻ end of the human α₁S. A 400 bp platelet VDCC α₁ subunit clone encodes the intracytoplasmic loop between motifs II-III. A human platelet VDCC α₁D subunit (hα₁D, 2682 bp) clone from human megakaryocytes encodes a polypeptide between IIIP and CO₂ ⁻ end of the human α₁D subunit. Aporcine platelet VDCC α₁S subunit polypeptide (pα₁S, 1031 bp) clone from porcine platelets encodes a polypeptide region between IV S₃ and aa 1531 of α₁S. A porcine platelet VDCC α₁D subunit polypeptide (pα₁D, 210 bp) clone from porcine platelets encodes a polypeptide between II S₃ and II S5. =glutamate residue in cloned regions; +=positively charged residue in cloned regions; P=pore in cloned regions; Δ=residues that contribute to DHP binding.

[0028]FIG. 1B is a schematic depicting sequences absent from human α₁S and α₁D and porcine α₁S in cloned regions (shaded areas).

[0029]FIG. 1C is a photograph of gel electrophoresis of a PCR product (1033 bp) from human megakaryocytes that was producing using oligonucleotides 1 and 2 (Table 3) as primers to amplify cDNA regions. The CDNA clones obtained in this reaction had two different sequences. Clones 1 and 2 (lanes 1 and 2) were α₁S, clone 3 (lane 3) was α₁D. Left lane, DNA low molecular weight markers VI (2.1-0.5 kb, Boehringer Mannheim, Indianapolis, Ind.)

[0030]FIG. 1D is an autoradiograph of a Northern blot of mRNA from human megakaryocytes. 10 μg of mRNA/lane were probed with riboprobes synthesized from the PCR products described for FIG. 1C. cmRNA synthesized from clone 1 (α₁S) was used to probe the mRNA on the left lane and cmRNA from clone 3 (α₁D) was synthesized and used to probe the mRNA on the right lane. Arrows point to a 6.5 kb transcript in α₁S and a 10.8 kb transcript in α₁D. In addition, a 2.5 kb band hybridized with both probes, which are homologous but not identical. cmRNA is antisense mRNA. Bars on right are 9.4 kb, 7.46 kb, 4.4 kb, 2.37 kb molecular weight standards.

[0031]FIG. 2A is an autoradiograph of a Western blot on cultured megakaryocytes, which depicts that α₁S and α₁D subunits are expressed at the protein level in platelets and megakaryocytes. Lane a: the antiserum generated in rabbit against the peptide NEELRAIIKKIWKRTSMKLL (SEQ ID NO:27) located in the putative carboxyl end of α₁ subunits (arrow in FIG. 1)] recognized its determinant in a 170 kDa polypeptide. Lane b: a monoclonal antibody, Mab 1A, previously described to recognize its determinant in α₁S from skeletal muscle was used to confirm that the 170 kDa polypeptide is α₁S. Lane c: normal rabbit IgG. Lane d: β cells from pancreas confirmed that the anti-peptide antibody recognized its determinant in α₁D (208 kDa) in β cells. Lane e: the monoclonal antibody Mab 1A did not recognize a determinant in α₁D. Lane f: normal non-immune rabbit serum. Bars on right are 218 kDa and 125 kDa molecular weight standards.

[0032]FIG. 2B is an autoradiograph of an immunoblot of immunoprecipitations on porcine platelets. The antipeptide antibody precipitated determinants in polypeptides with electrophoretic mobilities of 208 kDa and 170 kDa (lane a, arrows). The primary antibody was replaced with non-immune rabbit IgG in the immunoprecipitation control condition (lane b). Bars on right are 218 kDa and 125 kDa molecular weight standards.

[0033]FIG. 3 depicts a nucleotide sequence alignment of a human platelet VDCC α₁S subunit gene fragment of SEQ ID NO:5 and three closely related sequences, GenBank Accession No. XM_(—)001910 (SEQ ID NO:9), GenBank Accession No. NM_(—)00069 (SEQ ID:11), and GenBank Accession No. L33798 (SEQ ID NO:13).

[0034]FIG. 4 depicts a nucleotide sequence alignment of a porcine platelet VDCC α₁S subunit gene fragment of SEQ ID NO:7 and a human VDCC α₁S subunit fragment sequence, GenBank Accession No. XM_(—)001910 (SEQ ID NO:9).

[0035]FIG. 5 depicts a nucleotide sequence alignment of a human platelet VDCC α₁D subunit fragment of SEQ ID NO:6 and three closely related sequences, GenBank Accession No. XM_(—)003238 (SEQ ID NO:15), GenBank Accession No. NM_(—)000720 (SEQ ID:17), and GenBank Accession No. M83566 (SEQ ID NO:19).

DETAILED DESCRIPTION OF THE INVENTION

[0036] Disclosed herein is the first evidence for the expression of VDCC α₁ subunits in platelets. The present invention teaches novel members of the VDCC gene family that are expressed in platelets. Representative embodiments are set forth in SEQ ID NOs:1-8, 28, and 29.

[0037] Thus, the present invention pertains to isolated and purified nucleic acids encoding platelet VDCC α₁ subunit polypeptides, to isolated and purified platelet VDCC α₁ subunit polypeptides, to the characterization of the role played by the platelet VDCC α₁ subunit polypeptides in modulating calcium levels within and outside cells, and to the characterization of upstream or downstream processes affected by such modulation (e.g. inside-out and/or outside-in signaling).

[0038] Summarily, the identification of sequences that encode platelet VDCC α₁ subunit polypeptides, the cloning of the corresponding cDNAs, and the expression of the corresponding proteins affords the molecular tools required for modulating calcium homeostasis in platelets, and has application in the development of diagnostic, pharmacological and/or therapeutic applications, including treatments for various bleeding, thrombotic, and related disorders in human and animal subjects.

[0039] Following long-standing patent law convention, the terms “a” and “an”mean “one or more” when used in this application, including the claims.

[0040] A. General Considerations

[0041] Diversity among VDCCs, expressed in their biophysical and pharmacological properties, has led to their classification into several categories (Birnbaumer L., et al., Neuron (1994) 13:505-506). The long-activating (L-type) dihydropyridine (DHP) sensitive VDCC is such a channel. cDNA cloning of its major, pore forming α₁ subunit from skeletal muscle, α₁S, has led to a proposed structural model for all α₁ subunits (Tanabe T., et al., Nature (1987) 328:313-318 and FIG. 1). This model predicts that the encoded polypeptide contains four homologous but not identical tandem motifs (I-IV) which are made up of six transmembrane regions (S₁-S₆) each. This α₁ subunit contains sequences that convey to the channel calcium selectivity, voltage and pharmacological sensitivities, gating properties, and susceptibility to bind with other subunits and neighboring proteins. (Hille, B., Ionic channels of excitable membranes, Sunderland, Mass.: Sinauer Associate, Inc. Publishers (1992); Peres-Reyes, E. and Schneider, T., Kidney Int. (1995)48:1111-1124). While α₁S is the major subunit from L-type VDCC in skeletal muscle, α₁D is that from neuro-endocrine cells (Birnbaumer L., et al., Neuron (1994) 13:505-506; Seino, S., et al., Proc Natl Acad Sci U S A. (1992) 89:584-588).

[0042] In accordance with the present invention, the expression and ultrastructural localization of the VDCCs in platelets and their progenitor cell, the megakaryocyte, has been investigated. The novel polypeptides and polynucleotides disclosed herein represent the first identification of a VDCC in platelets and megakaryocytes.

[0043] The gene structure of platelet VDCC α₁ subunits derived from human megakaryocytes and porcine platelets is depicted in FIG. 1. The characteristic feature of platelet VDCC α₁ subunits as compared with known VDCC α₁ subunits is indicated as a missing sequence in the IV S₃-S₄ linker. FIG. 2 demonstrates that platelet VDCC α₁S and α₁D subunit polypeptides are expressed in platelets and megakaryocytes.

[0044] B. Polypeptides and Polynucleotides

[0045] As used in the following detailed description and in the claims, the term “platelet VDCC” includes nucleic acids and polypeptides encoding calcium channels in platelets and megakaryocytes. Platelet VDCC α₁S subunit nucleic acid sequences are expressed in platelets and megakaryotcytes, and optionally in other tissues. The term “platelet VDCC” specifically refers to a VDCC α₁ subunit characterized by the IV S₃-S₄ linker. Particularly, a platelet VDCCα₁S subunit polypeptide and a platelet VDCC α₁D subunit polypeptide are disclosed herein. Representative platelet VDCC α₁S subunits from human and porcine sources are disclosed in SEQ ID NOs:1, 2, 5, and 28 and SEQ ID NOs:7-8, respectively. Representative human platelet VDCC α₁D subunits are disclosed in SEQ ID NOs:3, 4, 6, and 29.

[0046] The term “platelet VDCC” further comprises vertebrate homologues of platelet calcium channel family members, including, but not limited to, mammalian and avian homologues. Representative mammalian homologues of platelet calcium channel family members include, but are not limited to, porcine and human homologues. VDCC α₁ subunit homologues are characterized by missing sequences that encode the IV S₃-S₄ linker region of VDCC α₁S polypeptides, as disclosed herein for human VDCC α₁S subunit, porcine VDCC α₁S subunit, and human VDCC α₁D subunit.

[0047] The terms “platelet VDCC α₁ subunit gene product”, “platelet VDCC α₁ subunit protein”, and “platelet VDCC α₁ subunit polypeptide” refer to peptides having amino acid sequences which are substantially identical to native amino acid sequences from the organism of interest and which are biologically active in that they comprise the amino acid sequence of a platelet VDCC α₁ subunit polypeptide, or cross-react with antibodies that specifically bind a platelet VDCC α₁ subunit polypeptide, or retain all or some of the biological activity of the native amino acid sequence or protein. Such biological activity can include immunogenicity. As disclosed herein below, an important feature of platelet VDCC α₁ subunits is the absence of sequences encoding a portion of the IV S₃-S₄ linker of VDCC α₁ subunit polypeptides.

[0048] The terms “platelet VDCC α₁ subunit gene product”, “platelet VDCC α₁ subunit protein”, and “platelet VDCC α₁ subunit polypeptide” are preferably meant to encompass a platelet VDCC α₁S subunit polypeptide or a platelet VDCC α₁D subunit polypeptide, including but not limited to those disclosed herein as SEQ ID NOs:2 and 4. Indeed, the definitions and explanations of “platelet VDCC α₁ subunit gene product”, “platelet VDCC α₁ subunit protein”, and “platelet VDCC α₁ subunit polypeptide” presented herein are also meant to be applied to a platelet VDCC α₁S subunit polypeptide or a platelet VDCC α₁D subunit polypeptide.

[0049] The terms “platelet VDCC α₁ subunit gene product”, “platelet VDCC α₁ subunit protein”, and “platelet VDCC α₁ subunit polypeptide” also include analogs of a platelet VDCC α₁ subunit polypeptide. By “analog” is intended that a DNA or peptide sequence can contain alterations relative to the sequences disclosed herein, yet retain all or some of the biological activity of those sequences. Analogs can be derived from nucleotide sequences as are disclosed herein or from other organisms, or can be created synthetically. Those skilled in the art will appreciate that other analogs, as yet undisclosed or undiscovered, can be used to design and/or construct calcium channel analogs. There is no need for a “platelet VDCC α₁ subunit gene product”, “platelet VDCC α₁ subunit protein”, and “platelet VDCC α₁ subunit polypeptide” to comprise all or substantially all of the amino acid sequence of a native platelet VDCC α₁ subunit polypeptide gene product. Shorter or longer sequences are anticipated to be of use in the invention; shorter sequences are herein referred to as “segments.” Thus, the terms “platelet VDCC α₁ subunit gene product”, “platelet VDCC α₁ subunit protein”, and “platelet VDCC α₁ subunit polypeptide” also include fusion or recombinant platelet VDCC α₁ subunit polypeptides and proteins comprising sequences of the present invention. Methods of preparing such proteins are known in the art.

[0050] The terms “platelet VDCC α₁ subunit gene ”, “platelet VDCC α₁ subunit gene segment”, “platelet VDCC α₁ subunit gene sequence”, “platelet VDCC α₁ subunit polynucleotide”, “platelet VDCC α₁ subunit nucleic acid molecule”, and “platelet VDCC α₁ subunit nucleic acid sequence” refer to any nucleic acid sequence (e.g. a DNA sequence) that is substantially identical to a polynucleotide sequence encoding a platelet VDCC α₁ subunit gene product, platelet VDCC α₁ subunit protein, or platelet VDCC α₁ subunit polypeptide as defined above, and can also comprise any combination of associated control sequences. The terms also refer to RNA, or antisense sequences, complementary to such DNA sequences. As used herein, the term “DNA segment” refers to a DNA molecule that has been isolated free of total genomic DNA of a particular species. Furthermore, a DNA segment encoding a platelet VDCC α₁ subunit polypeptide refers to a DNA segment that contains platelet VDCC α₁ subunit coding sequences, yet is isolated away from, or purified free from, total genomic DNA of a source species, such as Homo sapiens. Included within the term “DNA segment” are DNA segments and smaller fragments of such segments, and also recombinant vectors, including, for example, plasmids, cosmids, phages, viruses, and the like.

[0051] The terms “platelet VDCC α₁ subunit gene ”, “platelet VDCC α₁ subunit gene segment”, “platelet VDCC α₁ subunit gene sequence”, “platelet VDCC α₁ subunit polynucleotide”, “platelet VDCC α₁ subunit nucleic acid molecule”, and “platelet VDCC α₁ subunit nucleic acid sequence” are preferably meant to encompass a polynucleotide encoding a platelet VDCC α₁S subunit polypeptide or a platelet VDCC α₁D subunit polypeptide, including but not limited to those disclosed herein as SEQ ID NOs:1, 3, 5-8, 28, and 29. Indeed, the definitions and explanations of the terms “platelet VDCC act subunit gene”, “platelet VDCC α₁ subunit gene segment”, “platelet VDCCα₁ subunit gene sequence”, “platelet VDCC α₁ subunit polynucleotide”, “platelet VDCC α₁ subunit nucleic acid molecule”, and “platelet VDCC α₁ subunit nucleic acid sequence” presented herein are also meant to be applied to a platelet VDCC α₁S subunit polypeptide or a platelet VDCC α₁D subunit polypeptide.

[0052] A characteristic feature of the platelet VDCC α₁S subunits of the present invention is the absence of sequences that encode a portion of the IV S₃-S₄ linker of known VDCC α₁S subunits. The sequences that are lacking in the disclosed platelet VDCC α₁S subunits are set forth as SEQ ID NOs:23-24 (α₁S) and SEQ ID NOs:25-26 (α₁D). The absence of these sequences from the platelet VDCC α₁S subunits of the present invention is depicted in FIGS. 1A, 1B, and 3-5.

[0053]FIGS. 1A and 1B show a schematic represention of the sequences that are absent from human and porcine platelet VDCC α₁ subunits in the region of the IV S₃-S₄ linker.

[0054]FIG. 3 depicts a nucleotide sequence alignment of a human platelet VDCC α₁S subunit fragment of SEQ ID NO:5 and three closely related sequences, GenBank Accession No. XM_(—)001910 (SEQ ID NO:9), GenBank Accession No. NM_(—)00069 (SEQ ID:11), and GenBank Accession No. L33798 (SEQ ID NO:13). The nucleotide sequence of SEQ ID NO:5 represents an individual sequence read of a cloned human platelet VDCC α₁S subunit fragment. When using this fragment as a query sequence, BLAST analysis indicated the three above-mentioned GenBank sequences as haivng the highest level of sequence identity with platelet VDCC α₁S subunit sequences of the present invention. The alignment further reveals the absence of an about 57 base pair sequence (SEQ ID NO:23) in the human platelet VDCC α₁S subunit fragment of SEQ ID NO:5 compared to known VDCC α₁S subunit sequences. The deleted sequence is revealed by disruption of each alignment between contiguous base pairs 48 and 49 of the query sequence, and substantial sequence conservation is observed over the remainder of the query sequence.

[0055]FIG. 4 depicts a nucleotide sequence alignment of a porcine platelet VDCC α₁S subunit gene fragment of SEQ ID NO:7 and a human VDCC α₁S subunit fragment sequence, GenBank Accession No. XM_(—)001910 (SEQ ID NO:9). The nucleotide sequence of SEQ ID NO:7 represents an individual sequence read of a cloned porcine platelet VDCC α₁S subunit fragment. When using this fragment as a query sequence, BLAST analysis indicated human VDCC α₁S subunit gene (GenBank Accession No. 001910, SEQ ID NO:9) and rabbit dihydropyridine receptor (GenBank Accesssion No. X05921,SEQ ID NO:21) as being the most closely related sequences (Table 1). The alignment further reveals the absence of an about 57 base pair sequence (SEQ ID NO:23) in the porcine platelet VDCC α₁S subunit fragment of SEQ ID NO:7, as the alignment is disrupted between base pairs 60 and 67 base pairs of the query sequence, but substantial sequence conservation is observed over the remainder of the query sequence. TABLE 1 SEQ ID NO. applicant's reference best BLAST hit (ACCESSION) Score (bits) E value Identities 7 990210Db29 pa1S1511-1325 XM001910 761 0.0 516/599 92% 8 990208Cb18 pa1S1172-1365 X05921 563 0.0 395/432 91%

[0056] A comparison of the predicted amino acid sequences encoded by the human and porcine platelet VDCC α₁S subunit fragments demonstrates that the deleted seqeunce, relative to known VDCC α₁S subunit sequences, is similarly positioned in human and porcine platelet VDCC α₁S subunit sequences (FIG. 1A and 1B).

[0057]FIG. 5 depicts a nucleotide sequence alignment of a human platelet VDCC α₁D subunit gene fragment of SEQ ID NO:6 and three closely related sequences, GenBank Accession No. XM_(—)003238 (SEQ ID NO:15), GenBank Accession No. NM_(—)000720 (SEQ ID:17), and GenBank Accession No. M83566 (SEQ ID NO:19). The nucleotide sequence of SEQ ID NO:6 represents an individual sequence read of a cloned human platelet VDCC α₁D subunit fragment. When using this fragment as a query sequence, BLAST analysis indicated the three above-mentioned GenBank sequences as having the highest level of sequence identity with platelet VDCC α₁D subunit sequences of the present invention. The alignment further reveals the absence of an about 48 base pair sequence (SEQ ID NO:25) in the human platelet VDCC α₁D subunit fragment of SEQ ID NO:6 as compared to known sequences. The deleted sequence is revealed by disruption or each alignment between contiguous base pairs 44 and 49 of the query sequence. Substantial sequence conservation is observed over the remainder of the query sequence.

[0058] B.1. Sequence Similarity and Identity

[0059] As used herein, the term “substantially similar” means that a particular sequence varies from a platelet VDCC α₁ subunit nucleic acid sequence, or a platelet VDCC α₁ subunit amino acid sequence by one or more deletions, substitutions, or additions, the net effect of which is to retain at least some of biological activity of the natural gene, gene product, or sequence. For example, a “substantially similar” polypeptide can exhibit decreased or increased biological activity, as in a pathological or disease- or dysfunction-causing condition. Such sequences include “mutant” or “polymorphic” sequences, or sequences in which the biological activity is altered to some degree but retains at least some of the original biological activity. A critical measure of substantial identity of a platelet VDCC α₁ subunit is the absence of nucleotide sequences encoding and amino acid sequences comprising (SEQ ID NOs:23-26) a region of the IV S₃-S₄ linker of VDCC α₁ subunit polypeptides.

[0060] Nucleic acids that are substantially identical to SEQ ID NOs:1, 3, 5-8, 28, and 29 are preferred platelet VDCC α₁ subunit sequences, e.g. allelic variants, genetically altered versions of the gene, etc., bind to the provided platelet VDCC α₁ subunit sequences under stringent hybridization conditions. By using probes, particularly labeled probes of DNA sequences, one can isolate homologous or related genes. The source of homologous genes can be any species, e.g. primate species, rodents (such as rats and mice), canines, felines, bovines, equines, etc.

[0061] Between mammalian species, e.g. human and pig, homologues have substantial sequence similarity, i.e. at least 75% sequence identity between nucleotide sequences, and more preferably at least 90% sequence identity. Sequence similarity is calculated based on a reference sequence, which can be a subset of a larger sequence, such as a conserved motif, coding region, flanking region, etc. A reference sequence will usually be at least about 18 nt long, more usually at least about 30 nt long, and can extend to the complete sequence that is being compared. Algorithms for sequence analysis are known in the art, such as BLAST, described in Altschul et al. (1990) J. Mol. Biol. 215: 403-10. In determining nucleic acid sequences, all subject nucleic acid sequences capable of encoding substantially similar amino acid sequences are considered to be substantially similar to a reference nucleic acid sequence, regardless of differences in codon sequences or substitution of equivalent amino acids to create biologically functional equivalents.

[0062] Percent identity or percent similarity of a DNA or peptide sequence can be determined, for example, by comparing sequence information using the GAP computer program, available from the University of Wisconsin Geneticist Computer Group. The GAP program utilizes the alignment method of Needleman et al., (1970) J. Mol. Biol. 48: 443, as revised by Smith et al., (1981) Adv. Appl. Math. 2: 482. Briefly, the GAP program defines similarity as the number of aligned symbols (i.e., nucleotides or amino acids) which are similar, divided by the total number of symbols in the shorter of the two sequences. The preferred parameters for the GAP program are the default parameters, which do not impose a penalty for end gaps. See Schwartz et al., eds., (1979), Atlas of Protein Sequence and Structure, National Biomedical Research Foundation, pp. 357-358; Gribskov et al., (1986) Nucl. Acids. Res. 14: 6745.

[0063] The term “similarity” is contrasted with the term “identity”. Similarity is defined as above; “identity”, however, means a nucleic acid or amino acid sequence having the same amino acid at the same relative position in a given family member of a gene family. Homology and similarity are generally viewed as broader terms than the term identity. Biochemically similar amino acids, for example leucine and isoleucine or glutamate/aspartate, can be present at the same position—these are not identical per se, but are biochemically “similar.” As disclosed herein, these are referred to as conservative differences or conservative substitutions. This differs from a conservative mutation at the DNA level, which changes the nucleotide sequence without making a change in the encoded amino acid, e.g. TCC to TCA, both of which encode serine.

[0064] As used herein, DNA analog sequences are “substantially identical” to specific DNA sequences disclosed herein if: (a) the DNA analog sequence is derived from coding regions of the nucleic acid sequence shown in any of SEQ ID NOs:1, 3, 5-8, 28, and 29 and lacks sequences of SEQ ID NOs:23 and 25; or (b) the DNA analog sequence is capable of hybridization to any of SEQ ID NOs:1, 3, 5-8, 28, and 29 under stringent conditions, lacks sequences of SEQ ID NOs:23 and 25, and encodes a biologically active gene product of the nucleic acid sequence shown in any of SEQ ID NOs:1, 3, 5-8, 28, and 29; or (c) the DNA sequences are degenerate as a result of alternative genetic code to the DNA analog sequences defined in (a) and/or (b). Substantially identical analog proteins will be greater than about 60% identical to the corresponding sequence of the native protein. Sequences having lesser degrees of identity but comparable biological activity are considered to be equivalents.

[0065] As used herein, “stringent conditions” means conditions of high stringency, for example 6×SSC, 0.2% polyvinylpyrrolidone, 0.2% Ficoll, 0.2% bovine serum albumin, 0.1% sodium dodecyl sulfate, 100 g/mL salmon sperm DNA and 15% formamide at 68° C. For the purposes of specifying additional conditions of high stringency, preferred conditions are salt concentration of about 200 mM and temperature of about 45° C. One example of such stringent conditions is hybridization at 4×SSC, at 65° C., followed by a washing in 0.1×SSC at 65° C. for one hour. Another exemplary stringent hybridization scheme uses 50% formamide, 4×SSC at 42° C. Stringent features are understood to be able to detect a remoter address.

[0066] In contrast, nucleic acids having sequence similarity are detected by hybridization under lower stringency conditions. Thus, sequence identity can be determined by hybridization under lower stringency conditions, for example, at 50° C. or higher and 0.1×SSC (9 mM NaCl/0.9 mM sodium citrate) and the sequences will remain bound when subjected to washing at 55° C. in 1×SSC.

[0067] Thus, in certain embodiments, the invention concerns the use of platelet VDCC α₁ subunit genes and gene products that include within their respective sequences a sequence which is essentially that of a platelet VDCC α₁ subunit gene, or the corresponding protein. The term “a sequence essentially as that of a platelet VDCC α₁ subunit gene”, means that the sequence is substantially identical to a portion of a platelet VDCC α₁ subunit gene and contain a minority of bases or amino acids (whether DNA or protein) which are not identical to those of a platelet VDCC α₁ subunit protein or a platelet VDCC α₁ subunit gene, or which are not a biologically functional equivalent. The term “biologically functional equivalent” is well understood in the art and is further defined in detail herein.

[0068] Nucleotide sequences are “substantially identical” where they have between about 70% and about 80% or more preferably, between about 81% and about 90%, or even more preferably, between about 91% and about 99%, sequence identity for nucleic acid residues which are identical to the nucleotide sequence of a platelet VDCC α₁ subunit gene. Gene structure is also a useful measure of substantially identical sequences. In this case, the omission of sequences encoding a region of the IV S₃-S₄ linker of VDCC α₁ subunit polypeptides is a critical feature of the disclosed sequences.

[0069] Peptide sequences which have about 35%, or 45%, or preferably from 45-55%, or more preferably 55-65%, or most preferably 65% or greater amino acids which are identical or functionally equivalent or biologically functionally equivalent to the amino acids of a platelet VDCC α₁ subunit polypeptide will be sequences which are “substantially similar”. Peptide structure is also a useful indicator of substantially identical sequences. The present invention discloses sequences that have a shorter IV S₃-S₄ linker compared to known VDCC α₁ subunit polypeptides. Thus, this is an important feature when considering substantial similarity among VDCC α₁ subunit sequences.

[0070] Platelet VDCC α₁ subunit gene products and platelet VDCC α₁ subunit-encoding nucleic acid sequences which have functionally equivalent codons are also covered by the invention. The term “functionally equivalent codon” is used herein to refer to codons that encode the same amino acid, such as the ACG and AGU codons for serine (Table 2). Thus, when referring to the sequence examples presented in SEQ ID NOs:1, 3, 5-8, 28, and 29 applicants contemplate substitution of functionally equivalent codons of Table 2 into the sequence examples of SEQ ID NOs:1, 3, 5-8, 28, and 29. Thus, applicants are in possession of amino acid and nucleic acids sequences which include such substitutions but which are not set forth herein in their entirety for convenience. TABLE 2 Functionally Equivalent Codons. Amino Acids Codons Alanine Ala A GCA GCC GCG GCU Cysteine Cys C UGC UGU Aspartic Acid Asp D GAC GAU Glumatic acid Glu E GAA GAG Phenylalanine Phe F UUC UUU Glycine Gly G GGA GGC GGG GGU Histidine His H CAC CAU Isoleucine Ile I AUA AUC AUU Lysine Lys K AAA AAG Leucine Leu L UUA UUG CUA CUC CUG CUU Methionine Met M AUG Asparagine Asn N AAC AAU Proline Pro P CCA CCC CCG CCU Glutamine Gln Q CAA CAG Arginine Arg R AGA AGG CGA CGC CGG CGU Serine Ser S ACG AGU UCA UCC UCG UCU Threonine Thr T ACA ACC ACG ACU Valine Val V GUA GUC GUG GUU Tryptophan Trp W UGG Tyrosine Tyr Y UAC UAU

[0071] It will also be understood by those of skill in the art that amino acid and nucleic acid sequences can include additional residues, such as additional N- or C-terminal amino acids or 5′ or 3′ nucleic acid sequences, and yet still be essentially as set forth in one of the sequences disclosed herein, so long as the sequence retains biological protein activity where protein expression is concerned. The addition of terminal sequences particularly applies to nucleic acid sequences which can, for example, include various non-coding sequences flanking either of the 5′ or 3′ portions of the coding region or can include various internal sequences, i.e., introns, which are known to occur within genes.

[0072] The present invention also encompasses the use of nucleotide segments that are complementary to the sequences of the present invention. Nucleic acid sequences which are “complementary” are those which are base-paired according to the standard Watson-Crick complementarity rules. As used herein, the term “complementary sequences” means nucleic acid sequences which are substantially complementary, as can be assessed by the same nucleotide comparison set forth above, or is defined as being capable of hybridizing to the nucleic acid segment in question under moderately stringent conditions such as those described herein. A particular example of a provided complementary nucleic acid segment is an antisense oligonucleotide. In this case, complementary is defined by both cross-hybridization and the lack of coding sequences of SEQ ID NOs:23 and 25.

[0073] Hybridization can be used to assess complementary sequences and/or to isolate complementary nucleotide sequences. Nucleic acid hybridization will be affected by such conditions as salt concentration, temperature, or organic solvents, in addition to the base composition, length of the complementary strands, and the number of nucleotide base mismatches between the hybridizing nucleic acids, as will be readily appreciated by those skilled in the art. Stringent temperature conditions will generally include temperatures in excess of about 30° C., typically in excess of about 37° C., and preferably in excess of about 45° C. Stringent salt conditions will ordinarily be less than about 1,000 mM, typically less than about 500 mM, and preferably less than about 200 mM. However, the combination of parameters is much more important than the measure of any single parameter. (See e.g., Wetmur & Davidson, 1968). Determining appropriate hybridization conditions to identify and/or isolate sequences containing high levels of identity is well known in the art. (See e.g., Sambrook et al., 1989).

[0074] Representative moderate stringency conditions comprise, for example, hybridization at 50° C. and in 10×SSC (0.9 M NaCl/0.09 M sodium citrate), wherein the hybridized nucleic acid molecules remain bound when subjected to washing at 55° C. in 1×SSC. Sequence identity can be further determined by hybridization under more stringent conditions, for example, at 50° C. or higher and 0.1×SSC (9 mM NaCl/0.9 mM sodium citrate), as described below. Typically, under “stringent conditions” a probe will hybridize specifically to its target sequence, but to no other sequences.

[0075] For the purposes of specifying conditions of high stringency, preferred conditions are salt concentration of about 200 mM and temperature of about 45° C. One example of such stringent conditions is hybridization at 4×SSC, at 65° C., followed by a washing in 0.1×SSC at 65° C. for one hour. Another representative stringent hybridization scheme uses 50% formamide, 4×SSC at 42° C. As used herein, “stringent conditions” can also mean conditions of high stringency, for example 6×SSC, 0.2% polyvinylpyrrolidone, 0.2% Ficoll, 0.2% bovine serum albumin, 0.1% sodium dodecyl sulfate, 100 μg/mL salmon sperm DNA and 15% formamide at 68° C.

[0076] Nucleic acids that are substantially identical to the provided platelet VDCC α₁ subunit sequences, e.g. allelic variants, genetically altered versions of the gene, polymorphic sequences, etc., bind to the provided platelet VDCC α₁ subunit sequences under stringent hybridization conditions. By using probes, particularly labeled probes of DNA sequences, one can isolate homologous or related genes. The source of homologous genes can be any species, e.g. primate species, rodents (such as rats and mice), canines, felines, bovines, ovines, equines, etc.

[0077] Between mammalian species, e.g. human and pig, homologues display similar gene structure, have substantial sequence similarity, i.e. at least 75% sequence identity between nucleotide sequences, more preferably greater than 90% sequence similarity, and specifically lack nucleotide sequences set forth as SEQ ID NOs:23 and 25. Sequence similarity is calculated based on a reference sequence, which can be a subset of a larger sequence, such as a conserved motif, coding region, flanking region, etc. A reference sequence will usually be at least about 18 nt long, more usually at least about 30 nt long, and can extend to the complete sequence that is being compared. Algorithms for sequence analysis are known in the art, such as BLAST, described in Altschul et al. (1990) J Mol Biol 215:403-10. Another commonly used alignment program is entitled CLUSTAL W and is described in Nucleic Acids Res (1994) November 11;22(22):4673-80, among other places. The sequences provided herein are useful for recognizing platelet VDCC α₁ subunit related and homologous proteins in database searches.

[0078] At a biological level, identity is just that, i.e. the same amino acid at the same relative position in a given family member of a gene family. Homology and similarity are generally viewed as broader terms. For example, biochemically similar amino acids, for example leucine and isoleucine or glutamate/aspartate, can be present at the same position—these are not identical per se, but are biochemically “similar”. As disclosed herein, these are referred to as conservative differences or conservative substitutions. This differs from a conservative mutation at the DNA level, which changes the nucleotide sequence without making a change in the encoded amino acid, e.g. TCC to TCA, both of which encode serine.

[0079] The platelet VDCC α₁ subunit genes disclosed herein are thus homologous proteins, but when percentages are referred to herein, it is meant to refer to percent identity.

[0080] Probe sequences can also hybridize specifically to duplex DNA under certain conditions to form triplex or other higher order DNA complexes. The preparation of such probes and suitable hybridization conditions are disclosed herein and are known in the art.

[0081] As used herein and in the claims, the term “gene” refers to both genomic sequences and cDNA sequences. “Isolated substantially away from other coding sequences” means that the gene of interest, in this case, a platelet VDCC α₁ subunit gene, forms the significant part of the coding region of the DNA segment, and that the DNA segment does not contain large portions of naturally-occurring coding DNA, such as large chromosomal fragments or other functional genes or cDNA coding regions. Of course, this refers to the DNA segment as originally isolated, and does not exclude genes or coding regions later added to the segment by the hand of man.

[0082] Thus, in particular embodiments, the invention concerns isolated DNA segments and recombinant vectors incorporating DNA sequences which encode a platelet VDCC α₁ subunit polypeptide that includes within its amino acid sequence an amino acid sequence of the present invention. In other particular embodiments, the invention concerns recombinant vectors incorporating DNA segments which encode a protein comprising the amino acid sequence of a human platelet VDCC α₁ subunit polypeptide protein.

[0083] B.2. Biologically Functional Equivalents

[0084] As mentioned above, modifications and changes can be made in the structure of the platelet VDCC α₁ subunit polypeptide proteins and peptides described herein and still constitute a molecule having like or otherwise desirable characteristics. For example, certain amino acids can be substituted for other amino acids in a protein structure without appreciable loss of interactive capacity with, for example, structures in the nucleus of a cell. Since it is the interactive capacity and nature of a protein that defines that protein's biological functional activity, certain amino acid sequence substitutions can be made in a protein sequence (or the nucleic acid sequence encoding it) to obtain a protein with the same, enhanced, or antagonistic properties. Such properties can be achieved by interaction with the normal targets of the native protein, but this need not be the case, and the biological activity of the invention is not limited to a particular mechanism of action. It is thus provided in accordance with the present invention that various changes can be made in the sequence of the platelet VDCC α₁ subunit polypeptide proteins and peptides or underlying nucleic acid sequence without appreciable loss of their biological utility or activity.

[0085] Biologically functional equivalent peptides, as used herein, are peptides in which certain, but not most or all, of the amino acids can be substituted. Thus, when referring to the sequence examples presented in SEQ ID NOs:1, 3, 5-8, 28, and 29, applicants envision substitution of codons that encode biologically equivalent amino acids as described herein into the sequence examples of SEQ ID NOs:1, 3, 5-8, 28, and 29. Thus, applicants are in possession of amino acid and nucleic acids sequences which include such substitutions but which are not set forth herein in their entirety for convenience.

[0086] Alternatively, functionally equivalent proteins or peptides can be created via the application of recombinant DNA technology, in which changes in the protein structure can be engineered, based on considerations of the properties of the amino acids being exchanged, e.g. substitution of lie for Leu. Changes designed by man can be introduced through the application of site-directed mutagenesis techniques, e.g., to introduce improvements to the antigenicity of the protein or to test platelet VDCC α₁ subunit mutants in order to examine platelet VDCC α₁ subunit calcium transport activity, or other activity at the molecular level.

[0087] Amino acid substitutions, such as those which might be employed in modifying the platelet VDCC α₁ subunit polypeptide proteins and peptides described herein, are generally based on the relative similarity of the amino acid side-chain substituents, for example, their hydrophobicity, hydrophilicity, charge, size, and the like. An analysis of the size, shape and type of the amino acid side-chain substituents reveals that arginine, lysine and histidine are all positively charged residues; that alanine, glycine and serine are all of similar size; and that phenylalanine, tryptophan and tyrosine all have a generally similar shape. Therefore, based upon these considerations, arginine, lysine and histidine; alanine, glycine and serine; and phenylalanine, tryptophan and tyrosine; are defined herein as biologically functional equivalents. Other biologically functionally equivalent changes will be appreciated by those of skill in the art.

[0088] In making biologically functional equivalent amino acid substitutions, the hydropathic index of amino acids can be considered. Each amino acid has been assigned a hydropathic index on the basis of their hydrophobicity and charge characteristics, these are: isoleucine (+4.5); valine (+4.2); leucine (+3.8); phenylalanine (+2.8); cysteine (+2.5); methionine (+1.9); alanine (+1.8); glycine (−0.4); threonine (−0.7); serine (−0.8); tryptophan (−0.9); tyrosine (−1.3); proline (−1.6); histidine (−3.2); glutamate (−3.5); glutamine (−3.5); aspartate (−3.5); asparagine (−3.5); lysine (−3.9); and arginine (−4.5).

[0089] The importance of the hydropathic amino acid index in conferring interactive biological function on a protein is generally understood in the art (Kyte & Doolittle, 1982, incorporated herein by reference). It is known that certain amino acids can be substituted for other amino acids having a similar hydropathic index or score and still retain a similar biological activity. In making changes based upon the hydropathic index, the substitution of amino acids whose hydropathic indices are within ±2 of the original value is preferred, those which are within ±1 of the original value are particularly preferred, and those within ±0.5 of the original value are even more particularly preferred.

[0090] It is also understood in the art that the substitution of like amino acids can be made effectively on the basis of hydrophilicity. U.S. Pat. No. 4,554,101, incorporated herein by reference, states that the greatest local average hydrophilicity of a protein, as governed by the hydrophilicity of its adjacent amino acids, correlates with its immunogenicity and antigenicity, i.e. with a biological property of the protein. It is understood that an amino acid can be substituted for another having a similar hydrophilicity value and still obtain a biologically equivalent protein.

[0091] As detailed in U.S. Pat. No. 4,554,101, the following hydrophilicity values have been assigned to amino acid residues: arginine (+3.0); lysine (+3.0); aspartate (+3.0±1); glutamate (+3.0±1); serine (+0.3); asparagine (+0.2); glutamine (+0.2); glycine (0); threonine (−0.4); proline (−0.5±1); alanine (−0.5); histidine (−0.5); cysteine (−1.0); methionine (−1.3); valine (−1.5); leucine (−1.8); isoleucine (−1.8); tyrosine (−2.3); phenylalanine (−2.5); tryptophan (−3.4).

[0092] In making changes based upon similar hydrophilicity values, the substitution of amino acids whose hydrophilicity values are within ±2 of the original value is preferred, those which are within ±1 of the original value are particularly preferred, and those within ±0.5 of the original value are even more particularly preferred.

[0093] While discussion has focused on functionally equivalent polypeptides arising from amino acid changes, it will be appreciated that these changes can be effected by alteration of the encoding DNA, taking into consideration also that the genetic code is degenerate and that two or more codons can code for the same amino acid.

[0094] Thus, it will also be understood that this invention is not limited to the particular nucleic acid and amino acid sequences of SEQ ID NOs:1-8, 28, and 29. Recombinant vectors and isolated DNA segments can therefore variously include the platelet VDCC α₁ subunit polypeptide-encoding region itself, coding regions bearing selected alterations or modifications in the basic coding region, or larger polypeptides which nevertheless comprise platelet VDCC α₁ subunit polypeptide-encoding regions or can encode biologically functional equivalent proteins or peptides. Biological activity of a platelet VDCC α₁ subunit polypeptide can be determined, for example, by any of the assays disclosed herein below in Section G.2.

[0095] In particular embodiments, the invention concerns gene therapy methods that use isolated DNA segments and recombinant vectors incorporating DNA sequences which encode a protein comprising an amino acid sequence of SEQ ID NO:2 or 4 In other particular embodiments, the invention concerns isolated DNA sequences and recombinant DNA vectors incorporating DNA sequences which encode a protein comprising an amino acid sequence of a platelet VDCC α₁ subunit polypeptide protein from human or pig. In certain other embodiments, the invention concerns isolated DNA segments and recombinant vectors that comprise a nucleic acid sequence essentially as set forth in any of SEQ ID NOs:1, 3, 5-8, 28, and 29.

[0096] The nucleic acid segments of the present invention, regardless of the length of the coding sequence itself, can be combined with other DNA sequences, such as promoters, enhancers, polyadenylation signals, additional restriction enzyme sites, multiple cloning sites, other coding segments, and the like, such that their overall length can vary considerably. Thus, a nucleic acid fragment of almost any length can be employed, with the total length preferably being limited by the ease of preparation and use in the intended recombinant DNA protocol. For example, nucleic acid fragments can be prepared which include a short stretch complementary to a nucleic acid sequence set forth in any of SEQ ID NOs:1, 3, 5-8, 28, and 29, such as about 10 nucleotides, and which are up to 10,000 or 5,000 base pairs in length, with segments of 3,000 being preferred in certain cases. DNA segments with total lengths of about 4,000, 3,000, 2,000, 1,000, 500, 200, 100, and about 50 base pairs in length are also useful.

[0097] The DNA segments of the present invention encompass biologically functional equivalent platelet VDCC α₁ subunit polypeptides. Such sequences can rise as a consequence of codon redundancy and functional equivalency that are known to occur naturally within nucleic acid sequences and the proteins thus encoded. Alternatively, functionally equivalent proteins or peptides can be created via the application of recombinant DNA technology, in which changes in the protein structure can be engineered, based on considerations of the properties of the amino acids being exchanged. Changes can be introduced through the application of site-directed mutagenesis techniques, e.g., to introduce improvements to the antigenicity of the protein or to test platelet VDCC α₁ subunit mutants in order to examine activity in the modulation of calcium transport, or other activity at the molecular level. Site-directed mutagenesis techniques are known to those of skill in the art and are disclosed herein.

[0098] The invention further encompasses fusion proteins and peptides wherein the platelet VDCC α₁ subunit polypeptide coding region is aligned within the same expression unit with other proteins or peptides having desired functions, such as for purification or immunodetection purposes.

[0099] Recombinant vectors form important further aspects of the present invention. Particularly useful vectors are those in which the coding portion of the DNA segment is positioned under the control of a promoter. The promoter can be that naturally associated with the platelet VDCC α₁ subunit polypeptide gene, as can be obtained by isolating the 5′ non-coding sequences located upstream of the coding segment or exon, for example, using recombinant cloning and/or PCR technology and/or other methods known in the art, in conjunction with the compositions disclosed herein.

[0100] In other embodiments, certain advantages will be gained by positioning the coding DNA segment under the control of a recombinant, or heterologous, promoter. As used herein, a recombinant or heterologous promoter is a promoter that is not normally associated with a platelet VDCC α₁ subunit polypeptide gene in its natural environment. Such promoters can include promoters isolated from bacterial, viral, eukaryotic, or mammalian cells. Representative heterologous promoters can thus comprise PF4, α_(IIb/β3) (GPIIb/IIIa), P-selectin, or GPIb promoter, which have been disclosed in the art. Naturally, it will be important to employ a promoter that effectively directs the expression of the DNA segment in the cell type chosen for expression. The use of promoter and cell type combinations for protein expression is generally known to those of skill in the art of molecular biology (for example, see Sambrook et al., 1989, specifically incorporated herein by reference). The promoters employed can be constitutive or inducible and can be used under the appropriate conditions to direct high level expression of the introduced DNA segment, such as is advantageous in the large-scale production of recombinant proteins or peptides. Appropriate promoter systems provided for use in high-level expression include, but are not limited to, the vaccinia virus promoter and the baculovirus promoter.

[0101] In an alternative embodiment, the present invention provides an expression vector comprising a polynucleotide that encodes a biologically active platelet VDCC α₁ subunit polypeptide in accordance with the present invention. Also preferably, an expression vector of the present invention comprises a polynucleotide that encodes human or pig platelet VDCC α₁ subunit polypeptide gene product. More preferably, an expression vectorof the present invention comprises a polynucleotide that encodes a polypeptide comprising an amino acid residue sequence of SEQ ID NO:2 or 4. More preferably, an expression vector of the present invention comprises a polynucleotide comprising the nucleotide sequence of any of SEQ ID NOs:1, 3, 5-8, 28, and 29. Even more preferably, an expression vector of the invention comprises a polynucleotide operatively linked to an enhancer-promoter. More preferably still, an expression vector of the invention comprises a polynucleotide operatively linked to a prokaryotic promoter. Alternatively, an expression vector of the present invention comprises a polynucleotide operatively linked to an enhancer-promoter that is a eukaryotic promoter and the expression vector further comprises a polyadenylation signal that is positioned 3′ of the carboxy-terminal amino acid and within a transcriptional unit of the encoded polypeptide. The expression vector can further comprise an intronic sequence from a platelet VDCC α₁ subunit gene, another platelet or megakaryocytic intronic sequence, or any other intronic sequence, as it has been shown that the inclusion of an intronic sequence into an expression vector can increase levels of expression.

[0102] In yet another embodiment, the present invention provides a recombinant host cell transfected, infected or adsorbed with a polynucleotide that encodes a biologically active platelet VDCC α₁ subunit polypeptide in accordance with the present invention. SEQ ID NOs:1-8, 28, and 29 set forth nucleotide and amino acid sequences from representative vertebrates, human and pig. Also provided by the present invention are homologous or biologically functionally equivalent polynucleotides and platelet VDCC α₁ subunit polypeptides found in other vertebrates, including particularly dog and bovine homologues. Preferably, a recombinant host cell of the present invention is transfected with the polynucleotide that encodes human or pig platelet VDCC α₁ subunit polypeptide. More preferably, a recombinant host cell of the present invention is transfected with the polynucleotide sequence encoding or set forth in any of SEQ ID NOs:1-8, 28, and 29. Even more preferably, a recombinant host cell is a mammalian cell. Most preferably, a recombinant host cell is a platelet or a megakaryocyte.

[0103] In another aspect, a recombinant host cell of the present invention is a prokaryotic host cell, including parasitic and bacterial cells. Preferably, a recombinant host cell of the invention is a bacterial cell, preferably a strain of Escherichia coil. More preferably, a recombinant host cell comprises a polynucleotide under the transcriptional control of regulatory signals functional in the recombinant host cell, wherein the regulatory signals appropriately control expression of the platelet VDCC α₁ subunit polypeptide in a manner to enable all necessary transcriptional and post-transcriptional modification.

[0104] In yet another embodiment, the present invention provides a method of preparing a platelet VDCC α₁ subunit polypeptide comprising transfecting a cell with polynucleotide that encodes a biologically active platelet VDCC α₁ subunit polypeptide in accordance with the present invention, to produce a transformed host cell, and maintaining the transformed host cell under biological conditions sufficient for expression of the polypeptide. The polypeptide can be isolated if desired, using any suitable technique. The host cell can be a prokaryotic or eukaryotic cell. Preferably, the prokaryotic cell is a bacterial cell of Escherichia coli. More preferably, a polynucleotide transfected into the transformed cell comprises the nucleotide base sequence of any of SEQ ID NOs:1, 3, 5-8, 28, and 29.SEQ ID NOs:1-8, 28, and 29 set forth nucleotide and amino acid sequences for representative vertebrates, human and pig. Also provided by the present invention are homologues or biologically equivalent platelet VDCC α₁ subunit polypeptide polynucleotides and polypeptides found in other vertebrates, particularlywarm blooded vertebrates, more particularly mammals, and even more particularly bovine and rat homologues.

[0105] As mentioned above, in connection with expression embodiments to prepare recombinant platelet VDCC α₁ subunit polypeptide proteins and peptides, it is contemplated that longer DNA segments will most often be used, with DNA segments encoding the entire platelet VDCC α₁ subunit polypeptide protein, functional domains or cleavage products thereof, being most preferred. However, it will be appreciated that the use of shorter DNA segments to direct the expression of platelet VDCC α₁ subunit polypeptides or core regions, such as can be used to generate anti-platelet VDCC α₁ subunit polypeptide antibodies, also falls within the scope of the invention.

[0106] DNA segments which encode peptide antigens from about 15 to about 50 amino acids in length, or more preferably, from about 15 to about 30 amino acids in length are contemplated to be particularly useful. DNA segments encoding peptides will generally have a minimum coding length in the order of about 45 to about 150, or to about 90 nucleotides. DNA segments encoding full length proteins can have a minimum coding length on the order of about 4,000 or 5,000 nucleotides for a protein in accordance with any of SEQ ID NOs:1, 3, 5-8, 28, and 29.DNA segments of the present invention can contain 300, 400, 500, 1,000, 1,500, 2,000, 2,500, 3,000, 3,500, 4,000, 4,500, or up to 5,000 nucleotides. Peptides of the present invention can contain 10, 20, 50, 100, 200, 300, 400, 500, 750, 1,000, or up to 1,500 amino acids.

[0107] B.3.Sequence Modification Techniques

[0108] Modifications to the platelet VDCC α₁ subunit proteins and peptides described herein can be carried out using techniques known in the art, including site directed mutagenesis. Site-specific mutagenesis is a technique useful in the preparation of individual peptides, or biologically functional equivalent proteins or peptides, through specific mutagenesis of the underlying DNA. The technique further provides a ready ability to prepare and test sequence variants; for example, incorporating one or more of the foregoing considerations, by introducing one or more nucleotide sequence changes into the DNA. Site-specific mutagenesis allows the production of mutants through the use of specific oligonucleotide sequences which encode the DNA sequence of the desired mutation, as well as a sufficient number of adjacent nucleotides, to provide a primer sequence of sufficient size and sequence complexity to form a stable duplex on both sides of the deletion junction being traversed. Typically, a primer of about 17 to 30 nucleotides in length is preferred, with about 5 to 10 residues on both sides of the junction of the sequence being altered.

[0109] In general, the technique of site-specific mutagenesis is well known in the art as exemplified by publications (e.g., Adelman et al., (1983) DNA 2:183; Sambrook et al., 1989) and can be achieved in a variety of ways generally known to those of skill in the art.

[0110] B.4. Other Structural Equivalents

[0111] The knowledge of the structure of the platelet VDCC α₁ subunit polypeptide of the present invention provides a tool for investigating the mechanism of action of these proteins in a subject. For example, binding of these proteins to various substrate molecules can be predicted by various computer models. Upon discovering that such binding in fact takes place, knowledge of the protein structure then allows design and synthesis of small molecules which mimic the functional binding of the platelet VDCC α₁ subunit polypeptide to the substrate. This is the method of “rational” drug design, also described below.

[0112] Use of the isolated and purified platelet VDCC α₁ subunit polypeptide of the present invention in rational drug design is thus provided in accordance with the present invention. Additional rational drug design techniques are described in U.S. Pat. Nos. 5,834,228 and 5,872,011, herein incorporated in their entirety.

[0113] Thus, in addition to the peptidyl compounds described herein, other sterically similar compounds can be formulated to mimic the key portions of the peptide structure. Such compounds can be used in the same manner as the peptides of the invention and hence are also functional equivalents. The generation of a structural functional equivalent can be achieved by the techniques of modeling and chemical design known to those of skill in the art. It will be understood that all such sterically similar constructs fall within the scope of the present invention.

[0114] C. Introduction of Gene Products

[0115] In accordance with the present invention, where a platelet VDCC α₁ subunit gene itself is employed to introduce a platelet VDCC α₁ subunit gene product, a convenient method of introduction will be through the use of a recombinant vector that incorporates the desired gene, together with its associated control sequences. The preparation of recombinant vectors is well known to those of skill in the art and described in many references, such as, for example, Sambrook et al. (1989), incorporated herein in its entirety.

[0116] C.1. Vector Construction

[0117] It is understood that the DNA coding sequences to be expressed, in this case those encoding the platelet VDCC α₁ subunit gene products, are positioned in a vector adjacent to and under the control of a promoter. It is understood in the art that to bring a coding sequence under the control of such a promoter, one generally positions the 5′ end of the transcription initiation site of the transcriptional reading frame of the gene product to be expressed between about 1 and about 50 nucleotides “downstream” of (ie., 3′ of) the chosen promoter. One can also desire to incorporate into the transcriptional unit of the vector an appropriate polyadenylation site (e.g., 5′-AATAAA-3′), if one was not contained within the original inserted DNA. Typically, these poly-A addition sites are placed about 30 to 2000 nucleotides “downstream” of the coding sequence at a position prior to transcription termination.

[0118] While use of the control sequences of the specific gene will be preferred, other control sequences can be employed, so long as they are compatible with the genotype of the cell being treated. Thus, one can mention other useful promoters by way of example, including, e.g., an SV40 early promoter, a long terminal repeat promoter from retrovirus, an actin promoter, a heat shock promoter, a metallothionein promoter, and the like. Representative platelet specific promoters include but are not limited to PF4, α_(IIb) (GPIIb), P-selectin or GPIb.

[0119] As is known in the art, a promoter is a region of a DNA molecule typically within about 100 nucleotide pairs upstream of (i.e., 5′ to) the point at which transcription begins (i.e., a transcription start site). That region typically contains several types of DNA sequence elements that are located in similar relative positions in different genes.

[0120] Another type of discrete transcription regulatory sequence element is an enhancer. An enhancer imposes specificity of time, location and expression level on a particular coding region or gene. A major function of an enhancer is to increase the level of transcription of a coding sequence in a cell that contains one or more transcription factors that bind to that enhancer. An enhancer can function when located at variable distances from transcription start sites so long as a promoter is present.

[0121] As used herein, the phrase “enhancer-promoter” means a composite unit that contains both enhancer and promoter elements. An enhancer-promoter is operatively linked to a coding sequence that encodes at least one gene product. As used herein, the phrase “operatively linked” means that an enhancer-promoter is connected to a coding sequence in such a way that the transcription of that coding sequence is controlled and regulated by that enhancer-promoter. Means for operatively linking an enhancer-promoter to a coding sequence are well known in the art; the precise orientation and location relative to a coding sequence of interest is dependent, inter alia, upon the specific nature of the enhancer-promoter.

[0122] An enhancer-promoter used in a vector construct of the present invention can be any enhancer-promoter that drives expression in a cell to be transfected. By employing an enhancer-promoter with well-known properties, the level and pattern of gene product expression can be optimized. The vector can further comprise an intronic sequence from a platelet VDCC α1 subunit gene, another platelet or megakaryocytic intronic sequence, or any other intronic sequence, as it has been shown that the inclusion of an intronic sequence into an vector can increase levels of expression, particularly in the case of a gene therapy vector.

[0123] For introduction of, for example, a human platelet VDCC α₁ subunit gene, a vector construct that will deliver the gene to the affected cells is desired. Viral vectors can be used. These vectors will preferably be an adenoviral, a retroviral, a vaccinia viral vector, adeno-associated virus or Lentivirus; these vectors are preferred because they have been successfully used to deliver desired sequences to cells and tend to have a high infection efficiency. Suitable vector-platelet VDCC α₁ subunit gene constructs are adapted for administration as pharmaceutical compositions, as described herein below. Viral promoters can also be of use in vectors of the present invention, and are known in the art.

[0124] Commonly used viral promoters for expression vectors are derived from polyoma, cytomegalovirus, Adenovirus 2, and Simian Virus 40 (SV40). The early and late promoters of SV40 virus are particularly useful because both are obtained easily from the virus as a fragment that also contains the SV40 viral origin of replication. Smaller or larger SV40 fragments can also be used, provided there is included the approximately 250 bp sequence extending from the Hind III site toward the Bgl I site located in the viral origin of replication. Further, it is also possible, and often desirable, to utilize promoter or control sequences normally associated with the desired gene sequence, provided such control sequences are compatible with the host cell systems.

[0125] The origin of replication can be provided either by construction of the vector to include an exogenous origin, such as can be derived from SV40 or other viral source, or can be provided by the host cell chromosomal replication mechanism. If the vector is integrated into the host cell chromosome, the latter is often sufficient.

[0126] Where a platelet VDCC α₁ subunit gene itself is employed it will be most convenient to simply use a wild type platelet VDCC α₁ subunit gene directly. However, certain regions of a platelet VDCC α₁ subunit gene can be employed exclusively without employing an entire wild type platelet VDCC α₁ subunit gene. It is proposed that it will ultimately be preferable to employ the smallest region needed to modulate biological activity so that one is not introducing unnecessary DNA into cells which receive a platelet VDCC α₁ subunit gene construct. The biological activity of these regions can easily be determined by the assays reported herein.

[0127] C.2. Genetically Modified Cell Lines and Animals

[0128] It is also within the scope of the present invention to prepare a genetically modified cell line (e.g. platelet or megakarocyte) and/or a genetically modified animal (e.g. a transgenic non-human animal) that expresses a platelet VDCC α₁ subunit gene of the present invention, that does not express a platelet VDCC α₁ subunit gene, or that has a modified expression of a platelet VDCC α₁ subunit gene of the present invention. Preferred transgenic animals included but are not limited to mice, pigs and dogs.

[0129] Techniques for the preparation of genetically modified cell lines are disclosed herein above, and are generally known in the art. Modified megakaryocyte and platelet cell lines have utility in the study of the biological activity and in the preparation and development of laboratory and medical applications for platelets, including enhanced ability to store or otherwise manipulate platelets.

[0130] Techniques for the preparation of transgenic animals are known in the art. Exemplary techniques are described in U.S. Pat. No. 5,489,742 (transgenic rats); U.S. Pat. Nos. 4,736,866, 5,550,316, 5,614,396, 5,625,125 and 5,648,061 (transgenic mice); U.S. Pat. No. 5,573,933 (transgenic pigs); U.S. Pat. No. 5,162,215 (transgenic avian species) and U.S. Pat. No. 5,741,957 (transgenic bovine species), the entire contents of each of which are herein incorporated by reference.

[0131] With respect to a representative method for the preparation of a transgenic pig, cloned recombinant or synthetic DNA sequences or DNA segments encoding a platelet VDCC α₁ subunit polypeptide gene product are injected into fertilized eggs. The injected eggs are implanted in pseudo pregnant females and are grown to term to provide transgenic pigs whose cells express a platelet VDCC α₁ subunit polypeptide gene product.

[0132] Additionally, a genetically modified animal of the present invention can comprise a pig with targeted modification of the platelet VDCC α₁ subunit polypeptide gene. Pig strains with complete or partial functional inactivation of a platelet VDCC α₁ subunit polypeptide gene in megakaryocytes and/or in platelets are generated using standard techniques of site-specific recombination in embryonic stem cells. Capecchi, M. R. (1989) Science 244(4910):1288-92; Thomas, K. R., and Capecchi, M. R. (1990) Nature 346(6287):847-50; Delpire, E., et al. (1999) Nat Genet 22(2):192-5. Procedures analogous to those employed in the generation of a “knock-out” animal can be applied in the generation of a “knock-out” cell line.

[0133] Alternatives include the use of anti-sense or ribozyme VDCC constructs, driven by a universal or tissue-specific promoter, to reduce levels of a platelet VDCC α₁ subunit polypeptide in platelets or megakarocytes, thus achieving a “knock-down” of individual isoforms (Luyckx, V. A., et al. (1999) Proc Natl Acad Sci U S A 96(21):12174-9). The invention also provides the generation of animal strains with conditional or inducible inactivation of individual or multiple VDCC genes (Sauer, B. (1998) Methods 14(4):381-92). For example, pigs are created which lack expression of any platelet VDCC α₁ subunit polypeptide in platelets or megakaryocytes through the sequential mating of pig strains with lox-P-flanked VDCC genes with a transgenic line expressing Cre-recombinase in platelets or megakaryocytes, using a platelet specific promoter, such as PF4, α_(IIb) (GPIIb), P-selectin or GPIb (Ding, Y., et al. (1997) J Biol Chem 272(44):28142-8).

[0134] The present invention also provides animal strains with specific “knocked-in” modifications in a platelet VDCC α₁ subunit polypeptide gene. This includes animals with genetically (Forlino, A., et al. (1999) J Biol Chem 274(53):37923-31) and functionally (Kissel, H., et al. (2000) Embo J 19(6):1312-1326) relevant point mutations in the VDCC genes, in addition to manipulations such as the insertion of disease-specific repeat expansions (White, J. K., et al. (1997) Nat Genet 17(4):404-10).

[0135] D. Generation of Antibodies

[0136] In still another embodiment, the present invention provides an antibody that specifically binds a polypeptide of the present invention. Preferably, an antibody of the invention is a monoclonal antibody. Techniques for preparing and characterizing antibodies are well known in the art (See e.g., Antibodies A Laboratory Manual, E. Harlow and D. Lane, Cold Spring Harbor Laboratory, 1988).

[0137] The phrase “specifically (or selectively) binds to an antibody”, or “specifically (or selectively) immunoreactive with”, when referring to a protein or peptide, refers to a binding reaction which is determinative of the presence of the protein in a heterogeneous population of proteins and other biological materials. Thus, under designated immunoassay conditions, the specified antibodies bind to a particular protein and do not show significant binding to other proteins present in the sample. Specific binding to an antibody under such conditions can require an antibody that is selected for its specificity for a particular protein. For example, antibodies raised to a protein with an amino acid sequence encoded by any of the nucleic acid sequences of the invention can be selected to obtain antibodies specifically immunoreactive with that protein and not with other proteins.

[0138] Briefly, a polyclonal antibody is prepared by immunizing an animal with an immunogen comprising a polypeptide or polynucleotide of the present invention, and collecting antisera from that immunized animal. A wide range of animal species can be used for the production of antisera. Because of the relatively large blood volume of rabbits, a rabbit is a preferred choice for production of polyclonal antibodies.

[0139] As is well known in the art, a given polypeptide or polynucleotide can vary in its immunogenicity. It is often necessary therefore to couple the immunogen (e.g., a polypeptide or polynucleotide) of the present invention) with a carrier. Exemplary and preferred carriers are keyhole limpet hemocyanin (KLH) and bovine serum albumin (BSA). Other albumins such as ovalbumin, mouse serum albumin or rabbit serum albumin can also be used as carriers.

[0140] Reagents for conjugating a polypeptide or a polynucleotide to a carrier protein are well known in the art and include glutaraldehyde, N-maleimidobencoyl-N-hydroxysuccinimide ester, carbodiimide and bis-biazotized benzidine.

[0141] As is also well known in the art, immunogenicity to a particular immunogen can be enhanced by the use of non-specific stimulators of the immune response known as adjuvants. Exemplary and preferred adjuvants include complete Freund's adjuvant, incomplete Freund's adjuvants and aluminum hydroxide adjuvant.

[0142] The amount of immunogen used of the production of polyclonal antibodies varies, inter alia, upon the nature of the immunogen as well as the animal used for immunization. A variety of routes can be used to administer the immunogen, e.g. subcutaneous, intramuscular, intradermal, intravenous and intraperitoneal. The production of polyclonal antibodies is monitored by sampling blood of the immunized animal at various points following immunization. When a desired level of immunogenicity is obtained, the immunized animal can be bled and the serum isolated and stored.

[0143] In another aspect, the present invention provides a method of producing an antibody that specifically binds a platelet VDCC α₁ subunit polypeptide, the method comprising: (a) transfecting recombinant host cells with a polynucleotide that encodes that polypeptide; (b) culturing the host cells under conditions sufficient for expression of the polypeptide; (c) recovering the polypeptide; and (d) preparing antibodies to the polypeptide. Preferably, the platelet VDCC α₁ subunit polypeptide is capable of modulating calcium levels within or outside of cells in accordance with the present invention.

[0144] A monoclonal antibody of the present invention can be readily prepared through use of well-known techniques such as the hybridoma techniques exemplified in U.S. Pat. No 4,196,265 and the phage-displayed techniques disclosed in U.S. Pat. No. 5,260,203, the contents of which are herein incorporated by reference.

[0145] A typical technique involves first immunizing a suitable animal with a selected antigen (e.g., a polypeptide or polynucleotide of the present invention) in a manner sufficient to provide an immune response. Rodents such as mice and rats are preferred animals. Spleen cells from the immunized animal are then fused with cells of an immortal myeloma cell. Where the immunized animal is a mouse, a preferred myeloma cell is a murine NS-1 myeloma cell.

[0146] The fused spleen/myeloma cells are cultured in a selective medium to select fused spleen/myeloma cells from the parental cells. Fused cells are separated from the mixture of non-fused parental cells, for example, by the addition of agents that block the de novo synthesis of nucleotides in the tissue culture media. This culturing provides a population of hybridomas from which specific hybridomas are selected. Typically, selection of hybridomas is performed by culturing the cells by single-clone dilution in microtiter plates, followed by testing the individual clonal supernatants for reactivity with an antigen-polypeptides. The selected clones can then be propagated indefinitely to provide the monoclonal antibody.

[0147] By way of specific example, to produce an antibody of the present invention, mice are injected intraperitoneally with between about 1-200 μg of an antigen comprising a polypeptide of the present invention. B lymphocyte cells are stimulated to grow by injecting the antigen in association with an adjuvant such as complete Freund's adjuvant (a non-specific stimulator of the immune response containing killed Mycobacterium tuberculosis). At some time (e.g., at least two weeks) after the first injection, mice are boosted by injection with a second dose of the antigen mixed with incomplete Freund's adjuvant.

[0148] A few weeks after the second injection, mice are tail bled and the sera titered by immunoprecipitation against radiolabeled antigen. Preferably, the method of boosting and titering is repeated until a suitable titer is achieved. The spleen of the mouse with the highest titer is removed and the spleen lymphocytes are obtained by homogenizing the spleen with a syringe.

[0149] Mutant lymphocyte cells known as myeloma cells are obtained from laboratory animals in which such cells have been induced to grow by a variety of well-known methods. Myeloma cells lack the salvage pathway of nucleotide biosynthesis. Because myeloma cells are tumor cells, they can be propagated indefinitely in tissue culture, and are thus “immortal”. Numerous cultured cell lines of myeloma cells from mice and rats, such as murine NS-1 myeloma cells, have been established.

[0150] Myeloma cells are combined under conditions appropriate to foster fusion with the normal antibody-producing cells from the spleen of the mouse or rat injected with the antigen/polypeptide of the present invention. Fusion conditions include, for example, the presence of polyethylene glycol. The resulting fused cells are hybridoma cells. Like myeloma cells, hybridoma cells grow indefinitely in culture.

[0151] Hybridoma cells are separated from unfused myeloma cells by culturing in a selection medium such as HAT media (hypoxanthine, aminopterin, and thymidine). Unfused myeloma cells lack the enzymes necessary to synthesize nucleotides from the salvage pathway because they are killed in the presence of aminopterin, methotrexate, or azaserine. Unfused lymphocytes also do not continue to grow in tissue culture. Thus, only cells that have successfully fused (hybridoma cells) can grow in the selection media.

[0152] Each of the surviving hybridoma cells produces a single antibody. These cells are then screened for the production of the specific antibody that specifically binds an antigen/polypeptide of the present invention. Single cell hybridomas are isolated by limiting dilutions of the hybridomas. The hybridomas are serially diluted many times and, after the dilutions are allowed to grow, the supernatant is tested for the presence of the monoclonal antibody. The clones producing that antibody are then cultured in large amounts to produce an antibody of the present invention in convenient quantity.

[0153] By use of a monoclonal antibody of the present invention, specific polypeptides and polynucleotide of the invention can be recognized as antigens, and thus identified. Once identified, those polypeptides and polynucleotide can be isolated and purified by techniques such as antibody-affinity chromatography. In antibody-affinity chromatography, a monoclonal antibody is bound to a solid substrate and exposed to a solution containing the desired antigen. The antigen is removed from the solution through an immunospecific reaction with the bound antibody. The polypeptide or polynucleotide is then easily removed from the substrate and purified.

[0154] E. Detecting a Polynucleotide or a Polypeptide of the Present Invention.

[0155] Alternatively, the present invention provides a method of detecting a polypeptide of the present invention, wherein the method comprises immunoreacting the polypeptides with antibodies prepared according to the method described above to form antibody-polypeptide conjugates, and detecting the conjugates.

[0156] In yet another embodiment, the present invention provides a method of detecting messenger RNA transcripts that encode a polypeptide of the present invention, wherein the method comprises hybridizing the messenger RNA transcripts with polynucleotide sequences that encode the polypeptide to form duplexes; and detecting the duplex. Alternatively, the present invention provides a method of detecting DNA molecules that encode a polypeptide of the present invention, wherein the method comprises hybridizing DNA molecules with a polynucleotide that encodes that polypeptide to form duplexes; and detecting the duplexes.

[0157] The detection and screening assays disclosed herein can optionally be used as a prognosis tool and/or diagnostic aid. Platelet VDCC α₁ subunit polypeptides and nucleic acids can be readily used in clinical setting as a prognostic and/or diagnostic indicator for screening for levels of expression of platelet VDCC α₁ subunit polypeptides, or alterations in native sequences. The nucleotide sequences of the subject invention can be used to detect differences in gene or gene product sequences between normal, carrier, or affected individuals. As discussed herein above, such differences can consist of single-nucleotide changes or multiple changes, deletions, or additions in the native sequence which result in altered transcription, translation, or activity or biological activity or properties of the gene or gene product. These differences can be readily detected using the compositions of the present invention and techniques known in the art, including but not limited to SSCP analysis, RFLP analysis, or other PCR- or nucleotide-based analysis.

[0158] DNA segments of the invention or RNA having the sequence of, or a sequence complementary to, SEQ ID NOs:1, 3, 5-8, 28, and 29 can be used. Such polynucleic acids can comprise 10, 20, 40, 50, 70, 100, 250, 300, 400, 500, or 1,000 nucleotides or up to the full length of SEQ ID NOs:1, 3, 5-8, 28, and 29. Such polynucleic acids can, but need not, encode polypeptides which retain some or all of the biological activity of the native gene or gene product.

[0159] The present invention provides a method of screening a biological sample for the presence of a platelet VDCC α₁ subunit polypeptide. A biological sample to be screened can be a biological fluid such as extracellular or intracellular fluid, or a cell or tissue extract or homogenate. A biological sample can also be an isolated cell (e.g., in culture) or a collection of cells such as in a tissue sample or histology sample. A tissue sample can be suspended in a liquid medium or fixed onto a solid support such as a microscope slide.

[0160] In accordance with a screening assay method, a biological sample is exposed to an antibody that specifically binds the polypeptide whose presence is being assayed. Typically, exposure is accomplished by forming an admixture in a liquid medium that contains both the antibody and the candidate polypeptide. Either the antibody or the sample with the polypeptide can be affixed to a solid support (e.g., a column or a microtiter plate). Additional details of methods for such assays are known in the art. The presence of polypeptide in the sample is detected by evaluating the formation and presence of antibody-polypeptide conjugates. Techniques for detecting such antibody-antigen conjugates or complexes are well known in the art and include but are not limited to centrifugation, affinity chromatography and the like, and binding of a secondary antibody to the antibody-candidate receptor complex.

[0161] In one embodiment, detection is accomplished by detecting an indicator affixed to the antibody. Exemplary and well-known indicators include radioactive labels (e.g., ³²P, ¹²⁵I, ¹⁴C), a second antibody or an enzyme such as horseradish peroxidase. Techniques for affixing indicators to antibodies are known in the art.

[0162] In another aspect, the present invention provides a method of screening a biological sample for the presence of antibodies that specifically bind a platelet VDQC α₁ subunit polypeptide. Preferably the antibody so identified has activity in the modulation of platelet VDCC α₁ subunit polypeptide biological activity in accordance with the present invention. In accordance with such a method, a biological sample is exposed to a platelet VDCC α₁ subunit polypeptide under biological conditions and for a period of time sufficient for antibody-polypeptide conjugate formation and the formed conjugates are detected.

[0163] A DNA or RNA molecule and particularly a DNA segment or polynucleotide can be used for hybridization to a DNA or RNA source or sample suspected of encoding a platelet VDCC α₁ subunit polypeptide of the present invention; such molecules are referred to as “probes”, and such hybridization is “probing”. Such probes can be made synthetically. The probing is usually accomplished by hybridizing the oligonucleotide to a DNA source suspected of possessing a platelet VDCC α₁ subunit polypeptide gene. In some cases, the probes constitute only a single probe, and in others, the probes constitute a collection of probes based on a certain amino acid sequence or sequences of the polypeptide and account in their diversity for the redundancy inherent in the genetic code.

[0164] Other molecules which are neither DNA nor RNA but are capable of hybridizing in a similar manner and which are designed structurally to mimic the DNA or RNA sequence of a platelet VDCC α₁ subunit polypeptide gene are also provided. Here, a suitable source to examine is capable of expressing a polypeptide of the present invention and can be a genomic library of a cell line of interest. Alternatively, a source of DNA or RNA can include total DNA or RNA from the cell line of interest. Once the hybridization method of the invention has identified a candidate DNA segment, a positive clone can be confirmed by further hybridization, restriction enzyme mapping, sequencing and/or expression and testing.

[0165] Alternatively, such DNA molecules can be used in a number of techniques including their use as: (1) diagnostic tools to detect normal and abnormal DNA sequences in DNA derived from patient's cells; (2) reagents for detecting and isolating other members of the polypeptide family and related polypeptides from a DNA library potentially containing such sequences; (3) primers for hybridizing to related sequences for the purpose of amplifying those sequences; (4) primers for altering native platelet VDCC α₁ subunit DNA sequences; as well as (5) other techniques which rely on the similarity of the sequences of interest to those of the sequences herein disclosed.

[0166] As set forth above, in certain aspects, DNA sequence information provided by the invention allows for the preparation of probes that specifically hybridize to encoding sequences of a selected platelet VDCC α₁ subunit gene. In these aspects, probes of an appropriate length are prepared based on a consideration of the encoding sequence for a polypeptide of this invention. The ability of such probes to specifically hybridize to other encoding sequences lends them particular utility in a variety of embodiments. Most importantly, the probes can be used in a variety of assays for detecting the presence of complementary sequences in a given sample. However, other uses are envisioned, including the use of the sequence information for the preparation of mutant species primers, or primers for use in preparing other genetic constructions.

[0167] To provide certain of the advantages in accordance with the invention, a preferred nucleic acid sequence employed for hybridization studies or assays includes probe sequences that are complementary to or mimic at least a 14 to 40 or so long nucleotide stretch of a nucleic acid sequence of the present invention, such as a sequence shown in any of SEQ ID NOs:1, 3, 5-8, 28, and 29. A size of at least 14 nucleotides in length helps to ensure that the fragment is of sufficient length to form a duplex molecule that is both stable and selective. Molecules having complementary sequences over stretches greater than 14 bases in length are generally preferred, though, to increase stability and selectivity of the hybrid, and thereby improve the quality and degree of specific hybrid molecules obtained. One will generally prefer to design nucleic acid molecules having gene-complementary stretches of 14 to 20 nucleotides, or even longer where desired, such as 30, 40, 50, 60, 100, 200, 300, or 500 nucleotides or up to the full length of any of SEQ ID NOs:1, 3, 5-8, 28, and 29. Such fragments can be readily prepared by, for example, directly synthesizing the fragment by chemical synthesis, by application of nucleic acid amplification technology, such as the PCR technology of U.S. Pat. No. 4,683,202, herein incorporated by reference, or by introducing selected sequences into recombinant vectors for recombinant production.

[0168] Accordingly, a nucleotide sequence of the present invention can be used for its ability to selectively form duplex molecules with complementary stretches of the gene. Depending on the application envisioned, one employs varying conditions of hybridization to achieve varying degrees of selectivity of the probe toward the target sequence. For applications requiring a high degree of selectivity, one typically employs relatively stringent conditions to form the hybrids. For example, one selects relatively low salt and/or high temperature conditions, such as provided by 0.02M-0.15M salt (e.g. NaCl), including particularly 200 mM salt, at temperatures of 50° C. to 70° C., including particularly temperatures of about 55° C., about 60° C. and about 65° C. Such conditions are particularly selective, and tolerate little, if any, mismatch between the probe and the template or target strand.

[0169] Of course, for some applications, less stringent hybridization conditions are typically needed to allow formation of the heteroduplex; one of skill in the art will know how to adjust the hybridization conditions for optimizing particular procedures. For example, it is generally appreciated that conditions can be rendered more stringent by the addition of increasing amounts of formamide, which serves to destabilize the hybrid duplex in the same manner as increased temperature. Thus, hybridization conditions can be readily manipulated by one of skill in the art using known methods to carry out the desired function or experiment, without undue experimentation.

[0170] In another aspect, the present invention provides assay kits for detecting the presence of a polypeptide of the present invention in biological samples, where the kits comprise a first antibody capable of immunoreacting with the polypeptide. Preferably, the assay kits of the invention further comprise a second container containing a second antibody that immunoreacts with the first antibody. More preferably, the antibodies used in the assay kits of the present invention are monoclonal antibodies. Even more preferably, the first antibody is affixed to a solid support. More preferably still, the first and second antibodies comprise an indicator, and, preferably, the indicator is a radioactive label or an enzyme.

[0171] The present invention also provides an assay kit for screening agents. Such a kit can contain a polypeptide of the present invention. The kit can additionally contain reagents for detecting an interaction between an agent and a polypeptide of the present invention.

[0172] In an alternative aspect, the present invention provides assay kits for detecting the presence, in biological samples, of a polynucleotide that encodes a polypeptide of the present invention, the kits comprising a first container that contains a second polynucleotide identical or complementary to a segment of at least 10 contiguous nucleotide bases of, as a preferred example, any of SEQ ID NOs:1, 3, 5-8, 28, and 29. In another embodiment, the present invention provides assay kits for detecting the presence, in a biological sample, of antibodies that specifically binds a polypeptide of the present invention, the kits comprising a platelet VDCC α₁ subunit polypeptide that immunoreacts with the antibodies.

[0173] F. Mapping, Polynucleotide and Polypeptide Screening

[0174] In another embodiment of the invention, the nucleic acid sequences which encode a platelet VDCC α₁ subunit polypeptide can also be used to generate hybridization probes which are useful for mapping naturally occurring genomic sequences and/or disease loci. The sequences can be mapped to a particular chromosome or to a specific region of the chromosome using well-known techniques. Such techniques include FISH, FACS, or artificial chromosome constructions, such as yeast artificial chromosomes, bacterial artificial chromosomes, bacterial P1 constructions or single chromosome cDNA libraries as reviewed in Price, C. M. (1993) Blood Rev. 7:127-134, and Trask, B. J. (1991) Trends Genet. 7:149-154.

[0175] F.1. Mapping

[0176] FISH (as described in Verma et al. (1988) Human Chromosomes: A Manual of Basic Techniques, Pergamon Press, New York, N.Y.) can be correlated with other physical chromosome mapping techniques and genetic map data. Examples of genetic map data can be found in the 1994 Genome Issue of Science (265:1981f). Correlation between the location of the gene encoding a platelet VDCC α₁ subunit polypeptide on a physical chromosomal map and a specific disease, or predisposition to a specific disease, can help delimit the region of DNA associated with that genetic disease. The nucleotide sequences of the subject invention can be used to detect differences in gene sequences between normal, carrier, or affected individuals.

[0177] In situ hybridization of chromosomal preparations and physical mapping techniques such as linkage analysis using established chromosomal markers can be used for extending genetic maps. Often the placement of a gene on the chromosome of another mammalian species, such as mouse, reveals associated markers also found in other mammals such as humans even if the number or arm of a particular human chromosome is not known. New sequences can be assigned to chromosomal arms, or parts thereof, by physical mapping. This provides valuable information to investigators searching for disease genes using positional cloning or other gene discovery techniques. Once the disease or syndrome has been crudely localized by genetic linkage to a particular genomic region, for example, ataxia-telangiectasia (A-T) to 11q22-23 (Gatti, R. A. et al. (1988) Nature 336:577-580), any sequences mapping to that area can represent associated or regulatory genes for further investigation. The nucleotide sequences of the present invention can thus also be used to detect differences in the chromosomal location due to translocation, inversion, etc. among normal, carrier, or affected individuals.

[0178] The mapping methods of the present invention also employ genomic clones of the exons of a platelet VDCC α₁ subunit gene. Sequences for a human platelet VDCC α₁ subunit polypeptide gene are set forth in SEQ ID NOs:1-6, 28, and 29. Thus, the present invention also provides genetic assays based on the genomic sequence of the human platelet VDCC α₁ subunit polypeptide genes. An intronic sequence flanking an individual exon encoding a platelet VDCC α₁ subunit polypeptide is employed in the design of oligonucleotide primers suitable for the mutation analysis of human genomic DNA. Thus, intronic primers can be used to screen for genetic variants by a number of PCR-based techniques, including single-strand conformation polymorphism (SSCP) analysis (Orita, M., et al. (1989) Proc Natl Acad Sci USA 86(8):2766-70), SSCP/heteroduplex analysis, enzyme mismatch cleavage, and direct sequence analysis of amplified exons (Kestila, M., et al. (1998) Mol Cell 1(4), 575-82; Yuan, B., et al. (1999) Hum Mutat 14(5):440-6). Similar techniques can be applied to putative 5′-regulatory regions, e.g. the putative promoters 5′ of a platelet VDCC α₁ subunit gene.

[0179] F.2. Polynucleotide Screening

[0180] Automated methods can also be applied the large-scale characterization of single nucleotide polymorphisms (SNPs) (Brookes, A. J. (1999) Gene 234(2):177-186; Wang, D. G., et al. (1998) Science 280(5366):1077-82) within and near a platelet VDCC α₁ subunit gene. Once genetic variants have been detected in specific patient populations, the present invention provides assays to detect the mutation by methods such as allele-specific hybridization (Stoneking, M., et al. (1991) Am J Hum Genet 48(2):370-82), or restriction analysis of amplified genomic DNA containing the specific mutation. Again, these detection methods can be automated using existing technology (Wang, D. G., et al. (1998) Science 280(5366):1077-82). In the case of genetic disease or human phenotypes caused by repeat expansion (Lafreniere, R. G., et al. (1997) Nat Genet 15(3):298-302; Timchenko, L. T., and Caskey, C. T. (1996) Faseb J 10(14):1589-97, the invention provides an assay based on PCR of genomic DNA with oligonucleotide primers flanking the involved repeat.

[0181] As used herein and in the claims, the term “polymorphism” refers to the occurrence of two or more genetically determined alternative sequences or alleles in a population. A polymorphic marker is the locus at which divergence occurs. Preferred markers have at least two alleles, each occurring at frequency of greater than 1%. A polymorphic locus can be as small as one base pair.

[0182] The provided nucleic acid molecules can be labeled according to any technique known in the art, such as with radiolabels, fluorescent labels, enzymatic labels, sequence tags, etc. Such molecules can be used as allele-specific oligonucleotide probes. Body samples can be tested to determine whether a platelet VDCC α₁ subunit gene contains a polymorphism. Suitable body samples for testing include those comprising DNA, RNA or protein obtained from biopsies, including bone marrow biopsies; or from blood.

[0183] In one embodiment of the invention two pairs of isolated oligonucleotide primers are provided. These sets of primers are optionally derived from a platelet VDCC α₁ subunit exon. The oligonucleotide primers are useful, for example, in detecting a polymorphism of a platelet VDCC α₁ subunit gene. The primers direct amplification of a target polynucleotide prior to sequencing. In another embodiment of the invention isolated allele specific oligonucleotides (ASO) are provided. The allele specific oligonucleotides are also useful in detecting a polymorphism of a platelet VDCC α₁ subunit gene.

[0184] The terms “substantially complementary to” or “substantially the sequence of” refer to sequences which hybridize to the sequences provided (e.g. SEQ ID NOs: 1, 3, 5-8, 28, and 29) under stringent conditions as disclosed herein and/or sequences having sufficient identity with any of SEQ ID NOs:1, 3, 5-8, 28, and 29, such that the allele specific oligonucleotides of the invention hybridize to the sequence. The term “isolated” as used herein includes oligonucleotides substantially free of other nucleic acids, proteins, lipids, carbohydrates or other materials with which they can be associated, such association being either in cellular material or in a synthesis medium. A “target polynucleotide” or “target nucleic acid” refers to the nucleic acid sequence of interest e.g., a platelet VDCC α₁ subunit-encoding polynucleotide. Other primers which can be used for primer hybridization are readily ascertainable to those of skill in the art based upon the disclosure herein.

[0185] The primers of the invention embrace oligonucleotides of sufficient length and appropriate sequence so as to provide initiation of polymerization on a significant number of nucleic acids in the polymorphic locus. Specifically, the term “primer” as used herein refers to a sequence comprising two or more deoxyribonucleotides or ribonucleotides, preferably more than three, and more preferably more than eight and most preferably at least about 20 nucleotides of a platelet VDCC α₁ subunit exonic or intronic region as are disclosed herein. Such oligonucleotides are preferably between ten and thirty bases in length. Such oligonucleotides can optionally further comprises a detectable label.

[0186] Environmental conditions conducive to synthesis include the presence of nucleoside triphosphates and an agent for polymerization, such as DNA polymerase, and a suitable temperature and pH. The primer is preferably single stranded for maximum efficiency in amplification, but can be double stranded. If double stranded, the primer is first treated to separate its strands before being used to prepare extension products. The primer must be sufficiently long to prime the synthesis of extension products in the presence of the inducing agent for polymerization. The exact length of primer will depend on many factors, including temperature, buffer, and nucleotide composition. The oligonucleotide primer typically contains 12-20 or more nucleotides, although it can contain fewer nucleotides.

[0187] Primers of the invention are designed to be “substantially” complementary to each strand of the genomic locus to be amplified. This means that the primers must be sufficiently complementary to hybridize with their respective strands under conditions which allow the agent for polymerization to perform. In other words, the primers should have sufficient complementarity with the 5′ and 3′ sequences flanking the transition to hybridize therewith and permit amplification of the genomic locus.

[0188] Oligonucleotide primers of the invention are employed in the amplification method which is an enzymatic chain reaction that produces exponential quantities of polymorphic locus relative to the number of reaction steps involved. Typically, one primer is complementary to the negative (−) strand of the polymorphic locus and the other is complementary to the positive (+) strand. Annealing the primers to denatured nucleic acid followed by extension with an enzyme, such as the large fragment of DNA polymerase I (Klenow) and nucleotides, results in newly synthesized + and − strands containing the target polymorphic locus sequence. Because these newly synthesized sequences are also templates, repeated cycles of denaturing, primer annealing, and extension results in exponential production of the region (i.e., the target polymorphic locus sequence) defined by the primers. The product of the chain reaction is a discreet nucleic acid duplex with termini corresponding to the ends of the specific primers employed.

[0189] The oligonucleotide primers of the invention can be prepared using any suitable method, such as conventional phosphotriester and phosphodiester methods or automated embodiments thereof. In one such automated embodiment, diethylphosphoramidites are used as starting materials and can be synthesized as described by Beaucage et al., Tetrahedron Letters 22:1859-1862 (1981). One method for synthesizing oligonucleotides on a modified solid support is described in U.S. Pat. No. 4,458,066.

[0190] Any nucleic acid specimen, in purified or non-purified form, can be utilized as the starting nucleic acid or acids, providing it contains, or is suspected of containing, a nucleic acid sequence containing the polymorphic locus. Thus, the method can amplify, for example, DNA or RNA, including messenger RNA, wherein DNA or RNA can be single stranded or double stranded. In the event that RNA is to be used as a template, enzymes, and/or conditions optimal for reverse transcribing the template to DNA would be utilized. In addition, a DNA-RNA hybrid which contains one strand of each can be utilized. A mixture of nucleic acids can also be employed, or the nucleic acids produced in a previous amplification reaction herein, using the same or different primers can be so utilized. The specific nucleic acid sequence to be amplified, i.e., the polymorphic locus, can be a fraction of a larger molecule or can be present initially as a discrete molecule, so that the specific sequence constitutes the entire nucleic acid. It is not necessary that the sequence to be amplified be present initially in a pure form; it can be a minor fraction of a complex mixture, such as contained in whole human DNA.

[0191] DNA utilized herein can be extracted from a body sample, such as blood, tissue material (e.g. bone marrow tissue), and the like by a variety of techniques such as that described by Maniatis et. al. in Molecular Cloning: A Laboratory Manual, Cold Spring Harbor, N.Y., p 280-281 (1982). If the extracted sample is impure, it can be treated before amplification with an amount of a reagent effective to open the cells, or animal cell membranes of the sample, and to expose and/or separate the strand(s) of the nucleic acid(s). This lysing and nucleic acid denaturing step to expose and separate the strands will allow amplification to occur much more readily.

[0192] The deoxyribonucleotide triphosphates dATP, dCTP, dGTP, and dTTP are added to the synthesis mixture, either separately or together with the primers, in adequate amounts and the resulting solution is heated to about 90-100° C. from about 1 to 10 minutes, preferably from 1 to 4 minutes. After this heating period, the solution is allowed to cool, which is preferable for the primer hybridization. To the cooled mixture is added an appropriate agent for effecting the primerextension reaction (called herein “agent for polymerization”), and the reaction is allowed to occur under conditions known in the art. The agent for polymerization can also be added together with the other reagents if it is heat stable. This synthesis (or amplification) reaction can occur at room temperature up to a temperature above which the agent for polymerization no longer functions. Thus, for example, if DNA polymerase is used as the agent, the temperature is generally no greater than about 40° C. Most conveniently the reaction occurs at room temperature.

[0193] The agent for polymerization can be any compound or system which will function to accomplish the synthesis of primer extension products, including enzymes. Suitable enzymes for this purpose include, for example, E. coli DNA polymerase I, Klenow fragment of E. coli DNA polymerase, polymerase muteins, reverse transcriptase, other enzymes, including heat-stable enzymes (i.e., those enzymes which perform primer extension after being subjected to temperatures sufficiently elevated to cause denaturation), such as Taq polymerase. Suitable enzyme will facilitate combination of the nucleotides in the proper manner to form the primer extension products which are complementary to each polymorphic locus nucleic acid strand. Generally, the synthesis will be initiated at the 3′ end of each primer and proceed in the 5′ direction along the template strand, until synthesis terminates, producing molecules of different lengths.

[0194] The newly synthesized strand and its complementary nucleic acid strand will form a double-stranded molecule under hybridizing conditions described herein and this hybrid is used in subsequent steps of the method. In the next step, the newly synthesized double-stranded molecule is subjected to denaturing conditions using any of the procedures described above to provide single-stranded molecules.

[0195] The steps of denaturing, annealing, and extension product synthesis can be repeated as often as needed to amplify the target polymorphic locus nucleic acid sequence to the extent necessary for detection. The amount of the specific nucleic acid sequence produced will accumulate in an exponential fashion. PCR. A Practical Approach, ILR Press, Eds. McPherson et al. (1992).

[0196] The amplification products can be detected by Southern blot analysis with or without using radioactive probes. In one such method, for example, a small sample of DNA containing a very low level of the nucleic acid sequence of the polymorphic locus is amplified, and analyzed via a Southern blotting technique or similarly, using dot blot analysis. The use of non-radioactive probes or labels is facilitated by the high level of the amplified signal. Alternatively, probes used to detect the amplified products can be directly or indirectly detectably labeled, for example, with a radioisotope, a fluorescent compound, a bioluminescent compound, a chemiluminescent compound, a metal chelator or an enzyme. Those of ordinary skill in the art will know of other suitable labels for binding to the probe, or will be able to ascertain such, using routine experimentation.

[0197] Sequences amplified by the methods of the invention can be further evaluated, detected, cloned, sequenced, and the like, either in solution or after binding to a solid support, by any method usually applied to the detection of a specific DNA sequence such as dideoxy sequencing, PCR, oligomer restriction (Saiki et al., Bio/Technology3: 1008-1012 (1985), allele-specific oligonucleotide (ASO) probe analysis (Conner et al., Proc. Natl. Acad. Sci. U.S.A. 80:278 (1983), oligonucleotide ligation assays (OLAs) (Landgren et. al., Science 241:1007, 1988), and the like. Molecular techniques for DNA analysis have been reviewed (Landgren et. al., Science 242:229-237 (1988)).

[0198] Preferably, the method of amplifying is by PCR, as described herein and in U.S. Pat. Nos. 4,683,195; 4,683,202; and 4,965,188 each of which is hereby incorporated by reference; and as is commonly used by those of ordinary skill in the art. Alternative methods of amplification have been described and can also be employed as long as a VDCC locus amplified by PCR using primers of the invention is similarly amplified by the alternative means. Such alternative amplification systems include but are not limited to self-sustained sequence replication, which begins with a short sequence of RNA of interest and a T7 promoter. Reverse transcriptase copies the RNA into cDNA and degrades the RNA, followed by reverse transcriptase polymerizing a second strand of DNA.

[0199] Another nucleic acid amplification technique is nucleic acid sequence-based amplification (NASBA™) which uses reverse transcription and T7 RNA polymerase and incorporates two primers to target its cycling scheme. NASBA™ amplification can begin with either DNA or RNA and finish with either, and amplifies to about 10⁸ copies within 60 to 90 minutes.

[0200] Alternatively, nucleic acid can be amplified by ligation activated transcription (LAT). LAT works from a single-stranded template with a single primer that is partially single-stranded and partially double-stranded. Amplification is initiated by ligating a cDNA to the promoter olignucleotide and within a few hours, amplification is about 10⁸ to about 10⁹ fold. The QB replicase system can be utilized by attaching an RNA sequence called MDV-1 to RNA complementary to a DNA sequence of interest. Upon mixing with a sample, the hybrid RNA finds its complement among the specimen's mRNAs and binds, activating the replicase to copy the tag-along sequence of interest.

[0201] Another nucleic acid amplification technique, ligase chain reaction (LCR), works by using two differently labeled halves of a sequence of interest which are covalently bonded by ligase in the presence of the contiguous sequence in a sample, forming a new target. The repair chain reaction (RCR) nucleic acid amplification technique uses two complementary and target-specific oligonucleotide probe pairs, thermostable polymerase and ligase, and DNA nucleotides to geometrically amplify targeted sequences. A 2-base gap separates the oligo probe pairs, and the RCR fills and joins the gap, mimicking normal DNA repair.

[0202] Nucleic acid amplification by strand displacement activation (SDA) utilizes a short primer containing a recognition site for HincII with short overhang on the 5′ end which binds to target DNA. A DNA polymerase fills in the part of the primer opposite the overhang with sulfur-containing adenine analogs. HincII is added but only cuts the unmodified DNA strand. A DNA polymerase that lacks 5′ exonuclease activity enters at the cite of the nick and begins to polymerize, displacing the initial primer strand downstream and building a new one which serves as more primer.

[0203] SDA produces greater than about a 10⁷-fold amplification in 2 hours at 37° C. Unlike PCR and LCR, SDA does not require instrumented temperature cycling. Another amplification system useful in the method of the invention is the QB Replicase System. Although PCR is the preferred method of amplification of the invention, these other methods can also be used to amplify a platelet VDCC α₁ subunit locus as described herein. Thus, the term “amplification technique” as used herein and in the claims is meant to encompass all the foregoing methods.

[0204] In another embodiment of the invention a method is provided for identifying a subject having a polymorphism of a platelet VDCC α₁ subunit gene, comprising sequencing a target nucleic acid of a sample from a subject by dideoxy sequencing, preferably following amplification of the target nucleic acid.

[0205] In another embodiment of the invention a method is provided for identifying a subject having a polymorphism of a platelet VDCC α₁ subunit gene, comprising contacting a target nucleic acid of a sample from a subject with a reagent that detects the presence of a platelet VDCC α₁ subunit polymorphism and detecting the reagent. A number of hybridization methods and conditions are well known to those skilled in the art and are disclosed herein. Many of them are useful in carrying out the invention.

[0206] Accordingly, a nucleotide sequence of the present invention can be used for its ability to selectively form duplex molecules with complementary stretches of a platelet VDCC α₁ subunit gene. Depending on the application envisioned, one employs varying conditions of hybridization to achieve varying degrees of selectivity of the probe toward the target sequence. For applications requiring a high degree of selectivity, one typically employs relatively stringent conditions to form the hybrids. For example, one selects relatively low salt and/or high temperature conditions, such as provided by 0.02M-0.15M salt at temperatures of about 50° C. to about 70° C. including particularly temperatures of about 55° C., about 60° C. and about 65° C. Such conditions are particularly selective, and tolerate little, if any, mismatch between the probe and the template or target strand.

[0207] Of course, for some applications, for example, where one desires to prepare mutants employing a mutant primer strand hybridized to an underlying template or where one seeks to isolate polypeptide coding sequences from related species, functional equivalents, or the like, less stringent hybridization conditions are typically needed to allow formation of the heteroduplex. Under such circumstances, one employs conditions such as 0.15M-0.9M salt, at temperatures ranging from about 20° C. to about 55° C., including particularly temperatures of about 25° C., about 37° C., about 45° C., and about 50° C. Cross-hybridizing species can thereby be readily identified as positively hybridizing signals with respect to control hybridizations. In any case, it is generally appreciated that conditions can be rendered more stringent by the addition of increasing amounts of formamide, which serves to destabilize the hybrid duplex in the same manner as increased temperature. Thus, hybridization conditions can be readily manipulated, and thus will generally be a method of choice depending on the desired results.

[0208] In certain embodiments, it is advantageous to employ a nucleic acid sequence of the present invention in combination with an appropriate means, such as a label, for determining hybridization. A wide variety of appropriate indicator reagents are known in the art, including radioactive, enzymatic or other ligands, such as avidin/biotin, which are capable of giving a detectable signal. In preferred embodiments, one likely employs an enzyme tag such a urease, alkaline phosphatase or peroxidase, instead of radioactive or other environmentally undesirable reagents. In the case of enzyme tags, calorimetric indicator substrates are known which can be employed to provide a reagent visible to the human eye or spectrophotometrically, to identify specific hybridization with complementary nucleic acid-containing samples.

[0209] In general, it is envisioned that the hybridization probes described herein are useful both as reagents in solution hybridization as well as in embodiments employing a solid phase. In embodiments involving a solid phase, the sample containing test DNA (or RNA) is adsorbed or otherwise affixed to a selected matrix or surface. This fixed, single-stranded nucleic acid is then subjected to specific hybridization with selected probes under desired conditions. The selected conditions depend inter alia on the particular circumstances based on the particular criteria required (depending, for example, on the G+C contents, type of target nucleic acid, source of nucleic acid, size of hybridization probe, etc.). Following washing of the hybridized surface so as to remove nonspecifically bound probe molecules, specific hybridization is detected, or even quantified, by means of the label.

[0210] The materials for use in the method of the invention are ideally suited for the preparation of a screening kit. Such a kit can comprise a carrier having compartments to receive in close confinement one or more containers such as vials, tubes, and the like, each of the containers comprising one of the separate elements to be used in the method. For example, one of the containers can comprise an amplifying reagent for amplifying a platelet VDCC α₁ subunit DNA, such as the necessary enzyme(s) and oligonucleotide primers for amplifying target DNA from the subject.

[0211] The oligonucleotide primers include primers having a sequence derived from the group including, but not limited to: SEQ ID NOs:1, 3, 5-8, 28, and 29, or primer sequences substantially complementary or substantially homologous thereto. Oligonucleotide primers comprising target flanking 5′ and 3′ polynucleotide sequence have substantially the sequence set forth in the flanking 5′ and 3′ portions of any of SEQ ID NOs:1, 3, 5-8, 28, and 29, and sequences substantially complementary or homologous thereto. Other oligonucleotide primers for amplifying a platelet VDCC α₁ subunit will be known or readily ascertainable to those of skill in the art given the disclosure of the present invention presented herein.

[0212] A kit in accordance with the present invention can further comprise solutions, buffers or other reagents for extracting a nucleic acid sample from a biological sample obtained from a subject. Any such reagents as would be readily apparent to one of ordinary skill in the art fall within the scope of the present invention. By way of particular example, a suitable lysis buffer for the tissue or cells along with a suspension of glass beads for capturing the nucleic acid sample and an elution buffer for eluting the nucleic acid sample off of the glass beads comprise a reagent for extracting a nucleic acid sample from a biological sample obtained from a subject.

[0213] Other examples include commercially available extraction kits, such as the GENOMIC ISOLATION KIT A.S.A.P.™ (Boehringer Mannheim, Indianapolis, Ind.), Genomic DNA Isolation System (GIBCO BRL, Gaithersburg, Md.), ELU-QUIK™ DNA Purification Kit (Schleicher & Schuell, Keene, N.H.), DNA Extraction Kit (Stratagene, La Jolla, Calif.), TURBOGEN™ Isolation Kit (Invitrogen, San Diego, Calif.), and the like. Use of these kits according to the manufacturer's instructions is generally acceptable for purification of DNA prior to practicing the methods of the present invention.

[0214] F.3. Polypeptide Screening

[0215] A method of screening for a disorder affecting calcium homeostasis in platelets is also provided. The method comprises: (a) obtaining a biological sample from a subject; (b) determining an amount of a platelet VDCC α₁ subunit polypeptide present in the biological sample; (c) determining the activity of a platelet VDCC α₁ subunit polypeptide present in the biological sample; and (d) detecting variations in calcium transport activity between a wild type platelet VDCC α₁ subunit polypeptide and an isolated platelet VDCC α₁ subunit polypeptide, any calcium transport activity variations between the wild type platelet VDCC α₁ subunit polypeptide and the isolated platelet VDCC α₁ subunit indicating the possibility of a disorder affecting calcium homeostasis in platelets. Detecting an amount of a platelet VDCC α₁ subunit polypeptide present in the biological sample can also be carried out in assessing a response to medication in a subject.

[0216] The nucleic acid sequences shown in SEQ ID NOs:1, 3, 5-8, 28, and 29 comprise platelet VDCC α₁ subunit polypeptide encoding sequences that are isolated from wild type cells. The sequence represents the platelet VDCC α₁ subunit nucleic acid sequence occurring in nature and existing without mutation. Therefore, wild type cells, as referred to herein, are those cells occurring in nature that contain non-mutated platelet VDCC α₁ subunit nucleic acid sequences. The wild type sequence is the native nucleic acid sequence and is the sequence against which assessments of polymorphism and mutation are made.

[0217] In another embodiment, the present invention provides an antibody that specifically binds a platelet VDCC α₁ subunit polypeptide. Preferably, an antibody of the invention is a monoclonal antibody. More preferred antibodies distinguish between a wild type form and a mutant or polymorphic form of a platelet VDCC α₁ subunit polypeptide. Techniques for preparing such antibodies are disclosed herein. The antibodies can be used to screen for the presence of a mutant or polymorphic form of a platelet VDCC α₁ subunit polypeptide in a manner analogous to that set forth above with respect to polynucleotide screening

[0218] G. Screening for Modulators of VDCC Biological Activity

[0219] In yet another aspect, the present invention provides a method of screening substances for their ability to affect or modulate the biological activity of platelet VDCC α₁ subunit gene products, and for their ability to affect or modulate in vivo platelet VDCC α₁ subunit levels. This modulation can affect platelet activation and other biological functions of platelets. Compounds identified via the screening methods of the present invention have application as anti-thrombotic agents or as agents for modulation of other biological events mediated by platelets.

[0220] Utilizing the methods and compositions of the present invention, screening assays for the testing of candidate substances are performed. A candidate substance is a substance which potentially can promote or inhibit the biological activity of gene product by binding or other intermolecular interaction with a platelet VDCC α₁ subunit gene or gene product or control sequence.

[0221] G.1. Method of Screening for Modulators of Platelet VDCC α₁ Subunit Biological Activity

[0222] A representative method of screening candidate substances for their ability to modulate platelet VDCC α₁ subunit biological activity comprises: (a) establishing replicate test and control samples that comprise a biologically active platelet VDCC α₁ subunit polypeptide; (b) administering a candidate substance to test samples; (c) measuring the biological activity of the platelet VDCC α₁ subunit polypeptide in the test and the control samples; and (d) determining whether the candidate substance modulates platelet VDCC α₁ subunit biological activity relative to an appropriate control. By “modulate” it is intended an increase, decrease, preservation, maintenance or other effect of any or all biological activities or properties of a platelet VDCC. By way of additional example, a candidate a candidate substance identified according to the screening assay described herein has an ability to facilitate preservation of stored platelets. Thus, a candidate substance identified according to the screening assay described herein has an ability to modulate platelet VDCC α₁ subunit biological activity.

[0223] Such a candidate compound has utility in the treatment of disorders and conditions associated with the biological activity of a platelet VDCC α₁ subunit. Candidate compounds are typically about 500-1,000 daltons, and can be hydrophobic, polycyclic, or both, molecules. Such compounds should be considered as candidates for therapeutic intervention in accordance with the methods described herein below. Thus, compounds identified via the screening methods of the present invention have application as anti-thrombotic agents or as agents for modulation of other biological events mediated by platelets. Dosages of test agents can be determined by deriving dose-response curves, such as those disclosed in U.S. Pat. No. 5,849,578, herein incorporated by reference.

[0224] In a cell-free system, the method comprises establishing a control system comprising a platelet VDCC α₁ subunit polypeptide and a ligand to which the platelet VDCC α₁ subunit polypeptide is capable of binding, establishing a test system comprising the platelet VDCC α₁ subunit polypeptide, the ligand, and a candidate compound, and determining whether the candidate compound modulates platelet VDCC α₁ subunit activity in a cell-free system. A representative ligand comprises a monoclonal antibody, and in this embodiment, the biological activity or property screened includes binding affinity. Additionally, the platelet VDCC α₁ subunit polypeptide can be provided in a lipid bi-layer in accordance with techniques disclosed by Malouf, N. N., et al., Proc. Natl. Acad. Sci. USA 84:5019-5023 (1987) to facilitate the simulation of in vivo conditions in a cell-free setting.

[0225] In another embodiment of the invention, a platelet VDCC α₁ subunit polypeptide (e.g., platelet VDCC α₁ subunit or platelet VDCC α₁D subunit) or catalytic or immunogenic fragment or oligopeptide thereof, can be used for screening libraries of compounds in any of a variety of drug screening techniques. The fragment employed in such screening can be free in solution, affixed to a solid support, borne on a cell surface, or located intracellularly. The formation of binding complexes, between the platelet VDCC α₁ subunit polypeptide and the agent being tested, can be measured.

[0226] Another technique for drug screening which can be used provides for high throughput screening of compounds having suitable binding affinity to the protein of interest as described in published PCT application WO 84/03564, herein incorporated by reference. In this method, as applied to a platelet VDCC α₁ subunit polypeptide, large numbers of different small test compounds are synthesized on a solid substrate, such as plastic pins or some other surface. The test compounds are reacted with a platelet VDCC α₁ subunit polypeptide, or fragments thereof, and washed. Bound platelet VDCC α₁ subunit polypeptide is then detected by methods well known in the art. Purified platelet VDCC α₁ subunit polypeptide can also be coated directly onto plates for use in the aforementioned drug screening techniques. Alternatively, non-neutralizing antibodies can be used to capture the peptide and immobilize it on a solid support.

[0227] As is well known in the art, a screening assay can provide a cell under conditions suitable for testing the modulation of VDCC biological activity and/or levels of calcium channels. These conditions include but are not limited to pH, temperature, tonicity, the presence of relevant metabolic factors (e.g., metal ions such as for example Ca⁺⁺, growth factor, interleukins, or colony stimulating factors), and relevant modifications to the polypeptide such as glycosylation or prenylation. A polypeptide of the present invention can be expressed and utilized in a prokaryotic or eukaryotic cell. The host cell can also be fractionated into sub-cellular fractions where the receptor can be found. For example, cells expressing the polypeptide can be fractionated into the nuclei, the endoplasmic reticulum, vesicles, or the membrane surfaces of the cell. U.S. Pat. Nos. 5,837,479; 5,645,999; 5,786,152; 5,739,278; and 5,352,660 also describe exemplary screening assays, and the entire contents of each are herein incorporated by reference.

[0228] In one embodiment, a screening assay is designed to be capable of discriminating candidate substances having selective ability to interact with or modulate one or more of the genes or gene products of the present invention but which substances are without a substantially overlapping activity with another gene or gene product. For example, a substance can modulate the biological activity of a platelet VDCC α₁S subunit but have no effect, or a diminished effect, on a platelet VDCC α₁D subunit. Such selective effect can comprise a 30% greater effect on one test sample versus another, or more preferably 100% or greater effect.

[0229] A method of identifying modulators of a platelet calcium channel polypeptide by rational drug design is provided in accordance with the present invention. The method comprises the steps of designing a potential modulator for a platelet calcium channel polypeptide that will form non-covalent bonds with amino acids in the substrate binding site based upon the structure of a platelet VDCC α₁ subunit polypeptide; synthesizing the modulator; and determining whether the potential modulator modulates the activity of a calcium channel. Modulators can be synthesized using techniques known in the art. The determination of whether the modulator modulates the biological activity of a calcium channel is made in accordance with the screening methods disclosed herein, or by other screening methods known in the art.

[0230] A screening assay of the present invention can also involve determining the ability of a candidate substance to modulate, ie. preserve, inhibit or promote platelet VDCC α₁ subunit biological activity and preferably, to thereby modulate the biological activity of calcium channels in target cells. Target cells can be either naturally occurring cells known to contain a polypeptide of the present invention or transformed cells produced in accordance with a method of transformation set forth herein above. The test samples can further comprise a cell or cell line that expresses a platelet VDCC α₁ subunit polypeptide; the present invention also provides a recombinant cell line suitable for use in the exemplary method. Such cell lines can be mammalian, or human, or they can from another organism, including but not limited to yeast. Exemplary assays including genetic screening assays and molecular biology screens such as a yeast two-hybrid screen that will effectively identify platelet VDCC α₁ subunit-interacting genes important for calcium transport or other platelet VDCC α₁ subunit-mediated cellular method. One version of the yeast two-hybrid system has been described (Chien et al., 1991, Proc. Natl. Acad. Sci. USA, 88:9578-9582) and is commercially available from Clontech (Palo Alto, Calif.).

[0231] G.2. Assays for Biological Activity

[0232] Biological activity of a platelet VDCC α₁ subunit polypeptide can be determined, for example, by an assay disclosed immediately below. Indeed, the biological activity of a platelet VDCC α₁ subunit polypeptide of the present invention can also be monitored as an indicator of drug therapy, such anti-thrombotic therapy or other therapy directed at platelet activity.

[0233] G.2.a. Measurement of Cytoplasmic Calcium Signal

[0234] The conduction of calcium by the VDCC is measured in vitro by measuring changes in cytoplasmic free (ionized) calcium concentrations. Cytoplasmic calcium concentrations are measured by performing flow cytometric analysis with the calcium indicators fluo-3 and fura red as detailed by Novak and Rabinovitch (1994) Cytometry 17:135-141. Briefly, platelets in whole blood, platelet rich plasma or biological buffers are incubated with esters of fluo-3 and fura red, and then aspirated into the flow cytometric instrument for measurement of calcium-dependent fluorescent changes. Cytoplasmic calcium concentrations are calculated from the ratio or fluo-3 and fura red relative fluorescence intensities. Experiments are performed with increasing doses of VDCC inhibitors with platelets that are in the resting state and subjected to different concentrations of activation agonists and VDCC antagonists and agonists.

[0235] G.2.b. Platelet Activation Reactions

[0236] The following methods are used to analyze the effect of VDCC channel inhibition, potentiation and/or structural manipulation on platelet function.

[0237] 1. Activation time courses. Platelets are incubated at 37° C. with [³H]serotonin and ³²PO₄ for one hour to respectively load dense granules with a secretion marker and label the cytoplasmic ATP pool for protein phosphorylation studies. During this time period, some samples are incubated with increasing concentrations of PGI₂ to elevate intracellular cAMP. The cells are exposed with agitation to different concentrations of thrombin, ADP, epinephrine, collagen, phorbol ester and A23187. Samples are withdrawn, quenched, and analyzed to characterize the time course of aggregation, secretion, morphological changes and clot retraction.

[0238] 2. Aggregation. Aggregation is followed with flow cytometry by analyzing forward and side scatter patterns. Samples are be examined with light microscopy and scored morphometrically and subjected to optical aggrenometry.

[0239] 3. Secretion. Platelets are diluted and centrifuged during the activation time course to obtain a supernatant with granule contents. Dense, alpha and lysozomal granule secretion are analyzed respectively by measuring [³H]serotonin, thrombospondin and beta-N-acetylglucose aminadase levels in the supernatants. P-selectin expression on the surface membrane is quantified with flow cytometry after incubation of the cells with FITC-labeled monoclonal antibody to P-selectin (e.g., anti-P-selectin, Sigma Chemical Corp, St. Louis, Mo.).

[0240] 4. Ultrastructural analysis. Platelet samples are subjected to scanning and transmission electron-microscopy to study shape change, pseudopodia extension, granule centralization, microfilament organization and microtubule structure.

[0241] 5. Clot retraction. Microscale clot retraction analysis is performed in platelet-free plasma in siliconized glass capillary tubes with 10 μL samples. Time courses are initiated by adding thrombin for 1 unit/mL and calcium to 10 mM. The time course of retraction is followed by photographing the capillary sample every 10 seconds, and then the size of the clots are measured morphometrically.

[0242] 6. Intracellular stimulus response coupling. Protein kinase activities are followed by performing 2D isoelectric focusing, SDS-PAG electrophoresis and autoradiography as we have detailed elsewhere (Fischer et al., 2000; White et al., 1990). The identity of tyrosine, threonine and serine protein kinase substrates is confirmed with Western analysis.

[0243] 7. Analysis of cytoplasmic nucleotide pool. Platelets are incubated with ³²Pi to incorporate a radiolabel into cytoplasmic nucleotides. Cytoplasmic levels of ATP and ADP are measured with polyethyleneimine thin layer chromatography (TLC) and autoradiography as we have detailed elsewhere (Fischer et al., 2000). Ethanol solubility is used to determine the ratio of free vs. protein bound nucleotides (Holmsen, 1972). The intracellular nucleotide pool are examined before cross-linking, after cross-linking, and after lyophilization and rehydration. TLC analysis and sample preparation follow the procedure of Crabtree and Henderson (1971). Briefly, TLC plates are developed with 2 M sodium formate, pH=3.4, and then subjected to autoradiography. ATP, ADP and monophosphate spots are scraped and subjected to liquid scintillation counting to measure cytoplasmic levels of the nucleotides.

[0244] 8. GPIIb-IIIa function: fibrinogen binding and trafficking. The density of unligated GPIIb-IIIa on the platelet surface is determined by measuring RGD inhabitable ¹²⁵I-fibrinogen binding as detailed by Sanders et al., (1996). Also, the amount of fibrinogen that is present on the surface of platelets from alpha-granule secretion is measured by quantifying FITC conjugated anti-fibrinogen monoclonal antibody binding with fluorescence. The ¹²⁵I-fibrinogen and anti-fibrinogen monoclonal antibody-FITC binding studies are conducted with standards so as to respectively yield the number of unoccupied and fibrinogen-ligated GPIIb-IIIa receptor per cell. The functionality of clathrin-dependent trafficking of surface bound fibrinogen to alpha-granules and clathrin-independent internalization to lyososomes (see, for example, Benke, 1992) is followed with confocal microscopy as described by Merricks et al. (1998).

[0245] 9. GPIb-IX function: von Willebrand binding. The function of the von Willebrand receptor complex is analyzed by performing binding studies with ¹²⁵I-labeled von Willebrand factor (vWf) as detailed elsewhere (Khandelwal et al., 1997). Briefly, platelets are incubated with the radio labeled ligand, washed and then subjected to liquid scintillation counting. Binding studies are performed in the presence and absence of an inhibitory anti-GPIb monoclonal antibody (GPIb-mAb, Immunotech, Inc., Westbrook, Me.) and/or ristocetin.

[0246] G.3. Method of Screening for Modulators of In Vivo Platelet VDCC α₁ Subunit Levels

[0247] In accordance with the present invention there are also provided methods for screening candidate compounds for the ability to modulate in vivo platelet VDCC α₁ subunit levels and/or activity. Representative modulators of platelet VDCC α₁ subunit levels can comprise modulators of platelet VDCC α₁ subunit transcription or expression. Pharmaceuticals that increase or decrease the transcription or expression of platelet VDCC α₁ subunit encoding genes have important clinical application for the modulation of the biological activity of calcium channels. This modulation can affect calcium homeostasis in platelets.

[0248] This invention thus includes a method for discovery of compounds that modulate the expression levels of platelet VDCC α₁ subunit encoding genes, including not only the platelet VDCC α₁ subunit genes of the present invention but also other calcium channel polypeptide-encoding genes, and describes the use of such compounds. The general approach is to screen compound libraries for substances which increase or decrease expression of platelet VDCC α₁ subunit-encoding genes. Exemplary techniques are described in U.S. Pat. Nos. 5,846,720 and 5,580,722, the entire contents of each of which are herein incorporated by reference.

[0249] While the following terms are believed to be well understood by one of skill in the art, the following definitions are set forth to facilitate explanation of the invention.

[0250] “Transcription” means a cellular method involving the interaction of an RNA polymerase with a gene that directs the expression as RNA of the structural information present in the coding sequences of the gene. The method includes, but is not limited to the following steps: (a) the transcription initiation, (b) transcript elongation, (c) transcript splicing, (d) transcript capping, (e) transcript termination, (f) transcript polyadenylation, (g) nuclear export of the transcript, (h) transcript editing, and (i) stabilizing the transcript. “Expression” generally refers to the cellular methodes by which a biologically active polypeptide is produced from RNA.

[0251] “Transcription factor” means a cytoplasmic or nuclear protein which binds to such gene, or binds to an RNA transcript of such gene, or binds to another protein which binds to such gene or such RNA transcript or another protein which in turn binds to such gene or such RNA transcript, so as to thereby modulate expression of the gene. Such modulation can additionally be achieved by other mechanisms; the essence of “transcription factor for a gene” is that the level of transcription of the gene is altered in some way.

[0252] In accordance with the present invention there is provided a method of identifying a candidate compound or molecule that is capable of modulating the transcription level of a gene encoding a platelet VDCC α₁ subunit polypeptide and thus is capable of acting as a therapeutic agent in the modulation of platelet VDCC α₁ subunit polypeptide effects. This modulation can affect calcium homeostasis in platelets, platelet activation and other biological functions of platelets and can also effect platelet storage or production of platelet products. Such modulation can be direct, i.e., through binding of a candidate molecule directly to the nucleotide sequence, whether DNA or RNA transcript, or such modulation can be achieved via one or more intermediaries, such as proteins other than a platelet VDCC α₁ subunit polypeptide which are affected by the candidate compound and ultimately modulate platelet VDCC α₁ subunit polypeptide transcription by any mechanism, including direct binding, phosphorylation or dephosphorylation, etc.

[0253] This method comprises contacting a cell or nucleic acid sample with a candidate compound or molecule to be tested. These samples contain nucleic acids which can contain elements that modulate transcription and/or translation of a platelet VDCC α₁ subunit gene, such as a platelet VDCC α₁ subunit promoter or putative upstream regulatory region, or other VDCC α₁ subunit promoter or putative upstream regulatory region, and a DNA sequence encoding a polypeptide which can be detected in some way. Thus, the polypeptide can be described as a “reporter” or “marker.” Preferably, the candidate compound directly and specifically transcriptionally modulates expression of the platelet VDCC α₁ subunit polypeptide-encoding gene. Such have therapeutic or pharmaceutical uses in treating platelet VDCC α₁ subunit polypeptide-related diseases and/or disorders, in platelet-based medicine and laboratory efforts, and in preserving and transporting platelets.

[0254] The DNA sequence is coupled to and under the control of the promoter, under conditions such that the candidate compound or molecule, if capable of acting as a transcriptional modulator of a gene encoding platelet VDCC α₁ subunit polypeptide, causes the polypeptide to be expressed and so produces a detectable signal, which can be assayed quantitatively and compared to an appropriate control. Candidate compounds or molecules of interest can include those which increase or decrease, i.e., modulate, transcription from a platelet VDCC α₁ subunit promoters. The reporter gene can encode a reporter known in the art, such as luciferase, or it can encode a platelet VDCC α₁ subunit.

[0255] In certain embodiments of the invention the polypeptide so produced is capable of complexing with an antibody or is capable of complexing with biotin. In this case the resulting complexes can be detected by methods known in the art. The detectable signal of this assay can also be provided by messenger RNA produced by transcription of said reporter gene. Exactly how the signal is produced and detected can vary and is not the subject of the present invention; rather, the present invention provides the nucleotide sequences and/or putative regulatory regions of a platelet VDCC α₁ subunit for use in such an assay. The molecule to be tested in these methods can be a purified molecule, a homogenous sample, or a mixture of molecules or compounds. Further, in the method of the invention, the DNA in the cell can comprise more than one modulatable transcriptional regulatory sequence.

[0256] In accordance with the present invention there is also provided a rapid and high throughput screening method that relies on the methods described above. This screening method comprises separately contacting each of a plurality of substantially identical samples. In such a screening method the plurality of samples preferably comprises more than about 10⁴ samples, or more preferably comprises more than about 5×10⁴ samples.

[0257] G.4. Animal Models

[0258] In addition, animal-based systems can be used to identify compounds capable of modulating platelet VDCC α₁ subunit biological activity. Such animal models can be used for the identification of drugs, pharmaceuticals, therapies, and interventions that can be effective in modulating platelet VDCC α₁ subunit polypeptide biological activity. For example, animal models can be exposed to a compound that is suspected of exhibiting an ability to modulate platelet VDCC α₁ subunit polypeptide biological activity at a sufficient concentration and for a time sufficient to elicit such modulation of platelet VDCC α₁ subunit polypeptide biological activity in the exposed animals. The response of the animals to the exposure can be monitored by assessing in vivo platelet VDCC α₁ subunit polypeptide expression levels and activity, or by testing biological samples from the animal. As in the methods described above, the mechanism by which a compound modulates a platelet VDCC α₁ subunit polypeptide activity or achieves therapeutic effects can vary; the utility of the present invention does not depend on the precise mechanism by which an effect is achieved.

[0259] For example, an animal model of the present invention can comprise a pig with targeted modification of a pig platelet VDCC α₁ subunit polypeptide genes, as described herein above.

[0260] H. Modulation of Platelet VDCC α₁ Subunit Biological Activity in a Laboratory or Clinical Setting

[0261] An aspect of the invention encompasses any treatments that alter any aspect of platelet VDCC α₁ subunit polypeptide biological activity. Such methods of modulating the biological activity of a platelet calcium channel polypeptide are applicable in the laboratory and/or clinical setting to enhance the capability to store, freeze dry, dehydrate or otherwise manipulate platelets or platelet products, as well as being applicable in therapeutic intervention in a subject. Therapeutic intervention in a subject can encompass, for example, anti-thrombotic therapy or therapy via modulation of other biological events mediated by platelets. Representative anti-thrombotic therapy comprises treatment or prevention of arterial thromboses, e.g. the blood clots that cause heart attacks and strokes. Another representative therapeutic application comprises increasing platelet function in bleeding disorders (e.g. hemophilia), whether acquired or inherited.

[0262] As used herein, the terms “activity” and “biological activity” are meant to be synonymous and are meant to refer to any biological activity of a platelet VDCC α₁ subunit polypeptide (e.g., platelet VDCC α₁S subunit or platelet VDCC α₁D subunit). Representative biological activities of a platelet VDCC α₁ subunit include calcium transport or other biological activity in accordance with the present invention.

[0263] With respect to the therapeutic methods of the present invention, a preferred subject is a vertebrate subject. A preferred vertebrate is warm-blooded; a preferred warm-blooded vertebrate is a mammal. A preferred mammal is a pig or, most preferably, a human. As used herein and in the claims, the term “patient” is contemplated to include both human and animal patients. Thus, veterinary diagnostic and therapeutic uses are provided in accordance with the present invention and comprise a preferred embodiment of the present invention.

[0264] Contemplated is the treatment of mammals such as humans, as well as those mammals of importance due to being endangered, such as Siberian tigers; of economical importance, such as animals raised on farms for consumption by humans; and/or animals of social importance to humans, such as animals kept as pets or in zoos. Examples of such animals include but are not limited to: carnivores such as cats and dogs; swine, including pigs, hogs, and wild boars; ruminants and/or ungulates such as cattle, oxen, sheep, giraffes, deer, goats, bison, and camels; and horses. Also contemplated is the treatment of birds, including the treatment of those kinds of birds that are endangered and/or kept in zoos, as well as fowl, and more particularly domesticated fowl, i.e., poultry, such as turkeys, chickens, ducks, geese, guinea fowl, and the like, as they are also of economical importance to humans. Thus, contemplated is the treatment of livestock, including, but not limited to, domesticated swine, ruminants, ungulates, horses, poultry, and the like.

[0265] H.1. Modulation of Platelet VDCC α₁ Subunit Polypeptide Biological Activity

[0266] In one embodiment, the present inventive method comprises administering to a cell a substance that modulates, ie., inhibits or promotes a biological activity of a platelet VDCC α₁ subunit polypeptide. Such a substance can be identified according to any of the screening assays set forth above, either in vitro or in vivo. Representative cells include platelets and megakaryocytes. The cell can be in an in vitro setting or can be in a subject to be treated, such as a warm-blooded vertebrate as described herein above.

[0267] The method comprises treating a vertebrate subject suffering from a disorder associated with or mediated by platelet VDCC α₁ subunit polypeptide biological activity by administering to the subject an effective amount of a substance identified according to a screening assay described above. By the term “modulating”, it is meant that the substance can either promote or inhibit the biological activity of a platelet VDCC α₁ subunit, depending on the disorder to be treated, and can affect one or several of the platelet VDCC α₁ subunit polypeptides, including the platelet VDCC α₁D subunit polypeptide or the platelet VDCC α₁S subunit polypeptide, as well as other ion transporters, or other unrelated genes or gene products.

[0268] Therapeutic treatment can comprise the administration of antibodies against a chosen region of a platelet VDCC α₁ subunit polypeptide, the administration of a protein that enhances activity, or the administration of a protein that inhibits the transcription of the platelet VDCC α₁ subunit polypeptide. Such administration can provide treatment of disorders which can be caused or exacerbated by platelet VDCC α₁ subunit polypeptide-mediated mechanisms.

[0269] Insofar as a modulator of platelet VDCC α₁ subunit polypeptide activity can take the form of a polypeptide or of an anti-platelet VDCC α₁ subunit polypeptide monoclonal antibody or fragment thereof, it is to be appreciated that the potency can vary, and therefore a “therapeutically effective” amount can vary. However, as shown by the present assay methods, one skilled in the art can readily assess the potency and efficacy of a candidate platelet VDCC α₁ subunit polypeptide biological activity modulator of this invention and adjust the therapeutic regimen accordingly. A modulator of platelet VDCC α₁ subunit biological activity can be evaluated by a variety of techniques, including through the use of a responsive reporter, which drives expression of a reporter gene; interaction of platelet VDCC α₁ subunit polypeptides with a monoclonal antibody as described herein; and other assays known in the art and described herein.

[0270] The monoclonal antibodies or polypeptides of the invention can be administered parenterally by injection or by gradual infusion over time. Although the tissue to be treated can typically be accessed in the body by systemic administration and therefore most often treated by intravenous administration of therapeutic compositions, other tissues and delivery means are contemplated where there is a likelihood that the tissue targeted contains the target molecule and are known to those of skill in the art. The compositions are formulated in an appropriate manner and administered in a manner compatible with the dosage formulation.

[0271] H.2. Monoclonal Antibodies

[0272] The present invention describes, in one embodiment, platelet VDCC α₁ subunit polypeptide modulators in the form of monoclonal antibodies which were elicited in response to platelet VDCC α₁ subunit but which can immunoreact with any platelet VDCC α₁ subunit polypeptide, or with a specific isoform of a platelet VDCC α₁ subunit polypeptide, and bind the platelet VDCC α₁ subunit polypeptide to modulate biological activity. The invention also describes cell lines that produce the antibodies, methods for producing the cell lines, and methods for producing the monoclonal antibodies.

[0273] The term “antibody” or “antibody molecule” refers collectively to a population of immunoglobulin molecules and/or immunologically active portions of immunoglobulin molecules, i.e., molecules that contain a paratope. A paratope is the portion or portions of an antibody that is or are responsible for that antibody binding to an antigenic determinant, or epitope.

[0274] Representative antibodies for use in the present invention are intact immunoglobulin molecules, substantially intact immunoglobulin molecules, single chain immunoglobulins or antibodies, those portions of an immunoglobulin molecule that contain the paratope, including antibody fragments. Indeed, it is within the scope of the present invention that a monovalent modulator can optionally be used. Thus, the terms “modulate”, “modulating”, and “modulator” are intended to encompass such a mechanism.

[0275] The term “monoclonal antibody” refers to a population of antibody molecules that contain only one species of paratope and thus typically display a single binding affinity for any particular epitope with which it immunoreacts; a monoclonal antibody can have a plurality of antibody combining sites, each immunospecific for a different epitope, e.g., a bispecific monoclonal antibody. Methods of producing a monoclonal antibody, a hybridoma cell, or a hybridoma cell culture are described above.

[0276] It is also possible to determine, without undue experimentation, if a monoclonal antibody has the same or equivalent specificity or immunoreaction characteristics as a monoclonal antibody of this invention by ascertaining whether the former prevents the latter from binding to a preselected target molecule. If the monoclonal antibody being tested competes with the monoclonal antibody of the invention, as shown by a decrease in binding by the monoclonal antibody of the invention in standard competition assays for binding to the target molecule when present in the solid phase, then it is likely that the two monoclonal antibodies bind to the same, or a closely related, epitope.

[0277] Still another way to determine whether a monoclonal antibody has the specificity of a monoclonal antibody of the invention is to pre-incubate the monoclonal antibody of the invention with the target molecule with which it is normally reactive, and then add the monoclonal antibody being tested to determine if the monoclonal antibody being tested is inhibited in its ability to bind the target molecule. If the monoclonal antibody being tested is inhibited then, in all likelihood, it has the same, or functionally equivalent, epitopic specificity as the monoclonal antibody of the invention.

[0278] An additional way to determine whether a monoclonal antibody has the specificity of a monoclonal antibody of the invention is to determine the amino acid residue sequence of the CDR regions of the antibodies in question. “CDRs” (complementarity-determining regions) mean the three subregions of the light or heavy chain variable regions which have hypervariable sequences and form loop structures that are primarily responsible for making direct contact with antigen. Antibody molecules having identical, or functionally equivalent, amino acid residue sequences in their CDR regions have the same binding specificity. Methods for sequencing polypeptides are well known in the art. Further, other ways of determining whether antibodies have similar immunospecificities are known in the art and can be useful in practicing the present invention.

[0279] The immunospecificity of an antibody, its target molecule binding capacity, and the attendant affinity the antibody exhibits for the epitope are defined by the epitope with which the antibody immunoreacts. The epitope specificity is defined at least in part by the amino acid residue sequence of the variable region of the heavy chain of the immunoglobulin that comprises the antibody, and in part by the light-chain-variable-region amino acid residue sequence. Use of the terms “having the binding specificity of” or “having the binding preference of” indicates that equivalent monoclonal antibodies exhibit the same or similar immunoreaction (binding) characteristics and compete for binding to a preselected target molecule.

[0280] Humanized monoclonal antibodies offer particular advantages over monoclonal antibodies derived from other mammals, particularly insofar as they can be used therapeutically in humans. Specifically, human antibodies are not cleared from the circulation as rapidly as “foreign” antigens, and do not activate the immune system in the same manner as foreign antigens and foreign antibodies. Methods of preparing “humanized” antibodies are generally well known in the art, and can readily be applied to the antibodies of the present invention.

[0281] The use of a molecular cloning approach to generate antibodies, particularly monoclonal antibodies, and more particularly single chain monoclonal antibodies, is also provided. The production of single chain antibodies has been described in the art, see e.g., U.S. Pat. No. 5,260,203, the contents of which are herein incorporated by reference. For this approach, combinatorial immunoglobulin phagemid libraries are prepared from RNA isolated from the spleen of the immunized animal, and phagemids expressing appropriate antibodies are selected by panning on endothelial tissue. The advantages of this approach over conventional hybridoma techniques are that approximately 10⁴ times as many antibodies can be produced and screened in a single round, and that new specificities are generated by H and L chain combination in a single chain, which further increases the chance of finding appropriate antibodies. Thus, an antibody of the present invention, or a “derivative” of an antibody of the present invention, pertains to a single polypeptide chain binding molecule which has binding specificity and affinity substantially similar to the binding specificity and affinity of the light and heavy chain aggregate variable region of an antibody described herein.

[0282] H.3. Other Modulators

[0283] Given the disclosure of the platelet VDCC α₁ subunit polypeptide activity in tissues herein, chemical compounds (e.g. small molecule mimetics) can be used to modulate platelet VDCC α₁ subunit polypeptide activity in tissues in accordance with the methods of the present invention. The identification of such compounds is facilitated by the description of screening assays directed to platelet VDCC α₁ subunit polypeptide activity in tissues presented above. Such compounds are typically about 500-1,000 daltons, and can be hydrophobic, polycyclic, or both, molecules.

[0284] H.4. Gene Therapy

[0285] Platelet VDCC α₁ subunit polypeptide genes can be used for gene therapy in accordance with the present invention. Exemplary gene therapy methods, including liposomal transfection of nucleic acids into host cells, are described in U.S. Pat. Nos. 5,279,833; 5,286,634; 5,399,346; 5,646,008; 5,651,964; 5,641,484; and 5,643,567, the contents of each of which are herein incorporated by reference.

[0286] Briefly, gene therapy directed toward modulation of platelet VDCC α₁ subunit polypeptide levels, to thereby affect or modulate the biological activity of platelet VDCC α₁ subunit polypeptide in a target cell is described. This modulation can affect calcium transport, to thereby affect platelet activation or other biological effect. In one embodiment, a therapeutic method of the present invention provides a method for modulation of platelet VDCC α₁ subunit polypeptide levels comprising the steps of: (a) delivering to the cell an effective amount of a DNA molecule comprising a polynucleotide that encodes a polypeptide that modulates the biological activity of one or more than one platelet VDCC α₁ subunit polypeptide; and (b) maintaining the cell under conditions sufficient for expression of said polypeptide.

[0287] In a preferred embodiment, the delivered polypeptide comprises the sequence of SEQ ID NO:2 or 4. Delivery can be accomplished by injecting the DNA molecule into the cell. Where the cell is in a subject, administering comprises: (a) providing a vehicle that contains the DNA molecule; and (b) administering the vehicle to the subject.

[0288] A vehicle is preferably a cell transformed or transfected with the DNA molecule or a transfected cell derived from such a transformed or transfected cell. An exemplary and preferred transformed or transfected cell is a lymphocyte or a tumor cell from the tumor being treated. Means for transforming or transfecting a cell with a DNA molecule of the present invention are set forth above.

[0289] Alternatively, the vehicle is a virus or an antibody that specifically infects or immunoreacts with an antigen of the target tissue or tumor. An advantage of a viral infection system is that it allows for a very high level of infection into the appropriate recipient cell. Also, antibodies have been used to target and deliver DNA molecules.

[0290] It is also envisioned that this embodiment of the present invention can be practiced using alternative viral or phage vectors, including retroviral vectors, adenoviral vectors and vaccinia viruses whose genome has been manipulated in alternative ways so as to render the virus non-pathogenic. Methods for creating such a viral mutation are set forth in detail in U.S. Pat. No. 4,769,331, incorporated herein by reference.

[0291] In a preferred embodiment, the vector is a recombinant vector comprising: (a) a sequence of genomic viral DNA showing affinity for host cells; (b) a DNA sequence encoding a platelet VDCC α₁ subunit polypeptide and operatively linked to said sequence of genomic viral DNA; and (c) a selectable marker.

[0292] H.5. Method of Modulating In Vivo Platelet VDCC α₁ subunit polypeptide Levels in the Treatment of Related Diseases and Disorders

[0293] A method for transcriptionally modulating in a cell or in a multicellular organism the expression of a gene encoding a platelet VDCC α₁ subunit polypeptide to modulate platelet VDCC α₁ subunit polypeptide biological activity in the cell or organism is also provided in accordance with the present invention. This method comprises administering to cell or to the organism a compound at a concentration effective to transcriptionally modulate expression of platelet VDCC α₁ subunit polypeptide or cotransporters. Representative cells include platelets and megakaryocytes. The cell can be in an in vitro setting or can be in an organism to be treated, such as a warm-blooded vertebrate as described herein above.

[0294] In accordance with the present invention, the provided compound can optionally comprise an antibody or polypeptide prepared as described above and which transcriptionally modulates expression of platelet VDCC α₁ subunit polypeptides. Optionally, the antibody or polypeptide directly binds to DNA or RNA, or directly binds to a protein involved in transcription.

[0295] Particular chemical entities (e.g. small molecule mimetics) for use in accordance with the present invention do not naturally occur in any cell, whether of a multicellular or a unicellular organism. Even more particularly, the chemical entity is not a naturally occurring molecule, e.g. it is a chemically synthesized entity. Optionally, the compound can bind a modulatable transcription sequence of the gene. For example, the compound can bind a promoter region upstream of a nucleic acid sequence encoding a platelet VDCC α₁ subunit polypeptide.

[0296] In the methods above, modulation of transcription results in either upregulation or downregulation of expression of the gene encoding the protein of interest, depending on the identity of the molecule which contacts the cell.

[0297] H.6. Antisense Oligonucleotide Therapy

[0298] It is also provided according to the present invention that expression of a platelet VDCC α₁ subunit polypeptide can be modulated in a vertebrate subject through the administration of an antisense oligonucleotide derived from a nucleic acid molecule encoding a platelet VDCC α₁ subunit polypeptide, such as those described in SEQ ID NO:2 and 4. Therapeutic methods utilizing antisense oligonucleotides have been described in the art, for example, in U.S. Pat. Nos. 5,627,158 and 5,734,033, the contents of each of which are herein incorporated by reference.

[0299] H.7. Dosages

[0300] As used herein, an “effective” dose refers to one that is administered in doses tailored to a particular application in which calcium transport modulation or other modulation of platelet VDCC α₁ subunit biological activity is sought. For example, after review of the disclosure herein of the present invention, one of ordinary skill in the art can tailor the dosages to an individual patient, taking into account the particular formulation and method of administration to be used with the composition as well as patient height, weight, severity of symptoms, and stage of the disorder to be treated.

[0301] An effective dose and a therapeutically effective dose are generally synonymous. However, compounds can be administered to patients having reduced symptoms or even administered to patients as a preventative measure. Hence, the composition can be effective in therapeutic treatment even in the absence of symptoms of the disorder.

[0302] A unit dose can be administered, for example, 1 to 4 times per day. Most preferably, the unit dose is administered twice a day (BID). The dose depends on the route of administration and the formulation of a composition containing the compound or compounds. Further, it will be appreciated by one of ordinary skill in the art after receiving the disclosure of the present invention that it can be necessary to make routine adjustments or variations to the dosage depending on the combination of agents employed, on the age and weight of the patient, and on the severity of the condition to be treated.

[0303] Such adjustments or variations, as well as evaluation of when and how to make such adjustments or variations, are well known to those of ordinary skill in the art of medicine. Evaluation parameters and techniques can vary with the patient and the severity of the disease. Particularly useful evaluative techniques are disclosed in the Examples.

[0304] H.7.1. Gene Therapy Vector Construct Dosing.

[0305] Maximally tolerated dose (MTD) of vector construct when administered directly into the affected tissue is determined. Primary endpoints are: 1) the rate of transduction in abnormal and/or normal cells, 2) the presence and stability of this vector in the systemic circulation and in affected cells, and 3) the nature of the systemic (fever, myalgias) and local (infections, pain) toxicities induced by the vector. A secondary endpoint is the clinical efficacy of the vector construct.

[0306] For example, a 4 mL serum-free volume of viral (e.g. adenoviral, retroviral, etc.) vector construct (containing up to 5×10⁷ viral particles in AIM V media) is administered daily per session. During each session, 1 mL of medium containing the appropriate titer of vector construct is injected into 4 regions of the affected tissue for a total of 4 mL per session in a clinical examination room. This is repeated daily for 4 days (4 sessions). This 16 mL total inoculum volume over 4 days is proportionally well below the one safely tolerated by nude mice (0.5 mL/20 g body weight).

[0307] Patient evaluation includes history and physical examination prior to initiation of therapy and daily during the 4 day period of vector construct injection. Toxicity grading is done using the ECOG Common Toxicity Criteria. CBC, SMA-20, urinalysis, and conventional studies are performed daily during this period.

[0308] H.7.2. Dose escalation and MTD.

[0309] Patients are treated with 3×10⁶ viral particles×4. Once they have all recovered from all grade 2 or less toxicities (except alopecia), and as long as grade 3-4 toxicity is not encountered, a subsequent dose level is initiated in patients. As one grade 3 or 4 toxicity occurs at a given dose level, a minimum of 6 patients are enrolled at that level. As only 1 of 6 patients has grade 3 or 4 toxicity, dose escalation continues. The MTD of vector construct is defined as the dose where 2 of 6 patients experience grade 3 or 4 toxicity. If 2 of 3, or if 3 of 6 patients experience grade 3 or 4 toxicity, the MTD is defined as the immediately lower dose level.

[0310] The following escalation schema is followed: 1) level 1, 3×10⁶ viral particles; 2) level 2, 1×10⁷; 3) level 3, 3×10⁷; 4) level 4, 5×10⁷. Patients with measurable disease are evaluated for a clinical response to vector construct. Histology and local symptoms are followed.

[0311] H.8. Formulation of Therapeutic Compositions

[0312] The platelet VDCC α₁ subunit polypeptide biological activity modulating substances, gene therapy vectors, and substances that inhibit or promote expression of a platelet VDCC α₁ subunit polypeptide encoding nucleic acid segment described above are adapted for administration as a pharmaceutical compositions as described above. Additional formulation and dose preparation techniques have been described in the art, see for example, those described in U.S. Pat. No.5,326,902 issued to Seipp et al. on Jul. 5, 1994, U.S. Pat. No. 5,234,933 issued to Marnett et al. on Aug. 10, 1993, and PCT Publication WO 93/25521 of Johnson et al. published Dec. 23, 1993, the entire contents of each of which are herein incorporated by reference.

[0313] For the purposes described above, the identified substances can normally be administered systemically or partially, usually by oral or parenteral administration. The doses to be administered are determined depending upon age, body weight, symptom, the desired therapeutic effect, the route of administration, and the duration of the treatment, etc.; one of skill in the art of therapeutic treatment will recognize appropriate procedures and techniques for determining the appropriate dosage regimen for effective therapy. Various compositions and forms of administration are provided and are generally known in the art. Other compositions for administration include liquids for external use, and endermic linaments (ointment, etc.), suppositories and pessaries which comprise one or more of the active substance(s) and can be prepared by known methods.

[0314] Thus, the present invention provides pharmaceutical compositions comprising a polypeptide, polynucleotide, or molecule or compound of the present invention and a physiologically acceptable carrier. More preferably, a pharmaceutical composition comprises a compound discovered via the screening methods described herein below.

[0315] A composition of the present invention is typically administered parenterally in dosage unit formulations containing standard, well-known nontoxic physiologically acceptable carriers, adjuvants, and vehicles as desired. The term “parenteral” as used herein includes intravenous, intra-muscular, intra-arterial injection, or infusion techniques.

[0316] Injectable preparations, for example sterile injectable aqueous or oleaginous suspensions, are formulated according to the known art using suitable dispersing or wetting agents and suspending agents. The sterile injectable preparation can also be a sterile injectable solution or suspension in a nontoxic parenterally acceptable diluent or solvent, for example, as a solution in 1,3-butanediol.

[0317] Among the acceptable vehicles and solvents that can be employed are water, Ringer's solution, and isotonic sodium chloride solution. In addition, sterile, fixed oils are conventionally employed as a solvent or suspending medium. For this purpose any bland fixed oil can be employed including synthetic mono- or di-glycerides. In addition, fatty acids such as oleic acid find use in the preparation of injectables.

[0318] Preferred carriers include neutral saline solutions buffered with phosphate, lactate, Tris, and the like. Of course, one purifies the vector sufficiently to render it essentially free of undesirable contaminants, such as defective interfering adenovirus particles or endotoxins and other pyrogens such that it does not cause any untoward reactions in the individual receiving the vector construct. A preferred means of purifying the vector involves the use of buoyant density gradients, such as cesium chloride gradient centrifugation or column chromatography.

[0319] A transfected cell can also serve as a carrier. By way of example, a liver cell can be removed from an organism, transfected with a polynucleotide of the present invention using methods set forth above and then the transfected cell returned to the organism (e.g. injected intra-vascularly).

EXAMPLES

[0320] The following Examples have been included to illustrate modes of the invention. Certain aspects of the following Examples are described in terms of techniques and procedures found or contemplated by the present inventors to work well in the practice of the invention. These Examples are exemplified through the use of standard laboratory practices of the inventors. In light of the present disclosure and the general level of skill in the art, those of skill will appreciate that the following Examples are intended to be exemplary only and that numerous changes, modifications and alterations can be employed without departing from the spirit and scope of the invention.

Example 1 Isolation of Fresh Platelets and Cultured Megakaryocytes

[0321] Due to the paucity of undegraded mRNA in stored human platelets, mRNA was isolated from platelet precursor cells, megakaryocytes, in culture. The findings disclosed herein were confirmed in freshly isolated porcine platelets. Freshly isolated human platelets were used for the ultrastructural studies.

[0322] Blood was collected in sodium citrate (9:1 v/v) from informed and consenting healthy volunteers and from pigs according to the National Institute of Health (NIH) human and animal utilization guidelines and approved by University of North Carolina-Chapel Hill (UNC-CH) committees. Platelet-rich plasma (PRP) was obtained by centrifugation of the blood at 800 g for 5 min at room temperature (r. t.), and platelets were isolated at 2000 g centrifugation for 15 min at r. t. The platelets were immediately used for mRNA or protein isolations.

[0323] Meg 01 cells (a human megakaryocytic cell line available from the American Type Culture Collection (ATCC) in Manassas, Va.) were grown in RPMI (GIBCO BRL, Gaithersburg, Md.) supplemented with 10% fetal calf serum in 10% CO₂ at 37° C. Cells were harvested by pelleting. Human megakaryocytes were obtained from discarded bone marrow smears with permission from the (UNC-CH) Internal Review Board (IRB) committee.

Example 2 Preparation of mRNA for Northern Blots and RT-PCR

[0324] Platelets or megakaryocytes were lysed in a reagent sold under the registered trademark TRIPURE® by Boeringer Manheim of Indianapolis, Ind. via repetitive pipetting. Total RNA was isolated according to the specifications supplied by the vendor. The total RNA was treated with 20 units of RNAase-free DNAase per 100 μg of total RNA. mRNA was enriched using a mRNA kit sold under the registered trademark OLIGOTEX® by QIAGEN GMBH of Hilden, Germany. Approximately 10 μg of mRNA was size fractionated on gels (Ambion Inc., Austin, Tex.) in formaldehyde denaturing conditions for Northern blots. See FIG. 4. mRNA was transferred onto a membrane by capillary action for Northern blots.

[0325] Antisense riboprobes of 1033 nucleotides (nt) were synthesized to correspond to the region between IV S₃ and amino acid 1531 (aa1531) for α₁S and between IV S₃ and amino acid 1663 (aa1663) for α₁D (Seino S., Chen L., et al., Proc Natl Acad Sci U S A. (1992) 89:584-588). These probes were obtained from the PCR reaction using oligonucleotides 1×2 (Table 3). The probes were transcribed as the antisense strand from the linearized PCR-TOPO® cloning vector (Invitrogen Corporation of Carlsbad, Calif.) that contained the PCR product as an insert.

[0326] First strand cDNA was synthesized from 0.2 μg mRNA with Oligo-dt or random hexamers as primers. A reverse transcriptase enzyme sold under the trademark SUPERSCRIPT II™ by Life Technologies, Inc. of Rockville, Md. was used, and the reaction was carried out according to specifications provided by the vendor. Following the synthesis of the first strand, the mRNA template was removed with RNAase treatment. The cDNA obtained was ethanol precipitated and then used for PCR amplification. PCR primers were designed based upon regions of VDCC α₁S (Hogan K., et al., Genomics (1994) 24:608-609) and α₁D (Seino S., et al., Proc Natl Acad Sci U S A (1992) 89:584-588) subunits, such that isoform specific intervening sequences containing regions previously reported to have functional importance would be amplified. Primers (Table 3) were used at a concentration of 100 picomoles per reaction.

[0327] PCR was carried out according to standard techniques. The PCR products were examined by agarose gel electrophoresis and cloned into a vector sold under the registered trademark PCR II-TOPO® by Invitrogen Corporation of Carlsbad, Calif. The clones were screened by PCR, and the desired cDNA was prepared using a DNA Miniprep kit (Qiagen, Inc. of Sorrento, Calif.) for sequencing by the UNC-CH Automated DNA Sequencing Facility. TABLE 3 Oligonucleotide Oligo- Combination Predicted nucleotide 5′ 3′ for PCR polypeptide re- No. position position amplification ref. gion amplified 1 3763 3785 1 × 2 (see FIG. 1) α₁S IV S₃ - aa1531 2 4796 4760 1 × 2 (see FIG. 1) α₁D IV S3 - aa1633 3 4684 4705 3 × 4 α₁S aa1483 - end of the polypeptide 4 5845 5824 5 6661 6633 3 × 5 α₁D aa1586 - end of the polypeptide 6 3240 3265 6 × 7 α₁D III P - IV P 7 4220 4194 8 2221 2243 8 × 9 α₁S Intracyto- plasmic loop II - III 9 2622 2601 10 2949 2972 10 × 11 α₁S III S₄ - IV S₃

[0328] Oligonucleotide primers were used in PCR reactions to amplify regions of α₁S and α₁D from platelets and megakaryocytes. The 5′ and 3′ positions are numbered according to the cDNAs accessed in Gen Bank:L33798 (Hogan, K., et al. Genomics (1994) 24:608-609) and M83566 (Seino, S., et al., Proc Natl Acad Sci U S A. (1992) 89:584-588) for the human α₁S and α₁D cDNAs, respectively. The predicted polypeptide position of the encoded protein is designated. P=pore.

Example 3 Antibodies and Immunoprecipitation

[0329] A rabbit antibody was generated against the synthetic peptide NEELRAIIKKIWKRTSMKLL (SEQ ID NO:27) which corresponds to the sequence aa 1487-1506 (arrow in FIG. 1A) in the putative intracytoplasmic carboxyl-terminal region of adult rabbit and human α₁S (Tanabe T., et al., Nature (1987) 328:313-318; Hogan, K., et al. Genomics (1994) 24:608-609). This sequence is common to many L-type calcium channel α₁ subunits from various species. A tetravalent multiple antigenic peptide was synthesized by the protein chemistry laboratories of UNC-CH and was used for immunization at a concentration of 500 μg for the initial dose. A boosting dose of 100 μg of peptide was administered twice at 3 week intervals. Antibody response was detected with ELISA using the synthetic peptide as antigen and on Western blots using megakaryocytes solubilized with Laemmli sample buffer and fractionated by SDS-PAGE electrophoresis.

[0330] Pig platelets were used at a concentration of 1.5×10¹¹/mL. They were isolated from 1 L of fresh whole blood, pelleted and washed twice in RPMI supplemented with a cocktail of protease inhibitors. Washed platelets were then lysed in a lysis buffer (Boeringer Manheim of Indianapolis, Ind.) which contained 50 Mm Tris-HCl, pH 7.5, 150 Mm NaCl, 1% Nonidet P40, 0.5% sodium deoxylate and a cocktail of protease inhibitors (supplied by Boeringer Manheim of Indianapolis, Ind.). The platelet lysate was transferred to a pre-chilled Dounce homogenizer and homogenized by approximately 10-15 repeated strokes using a type B pestle. After clearing with centrifugation at 12,000 g the supernate was preabsorbed onto protein A-agarose suspension overnight at 4° C. The beads were discarded, and the supernate was incubated overnight at 4° C. with the purified rabbit anti-peptide IgG at a concentration of 200 μg/mL. Controls were incubated with a non-immune rabbit IgG at the same concentration (Dako Corp. of Carpinteria, Calif.).

[0331] The immune-complexes were captured with 100 μg of a protein A-agarose beads overnight, at 4° C. After extensive washings, the immunoprecipitated proteins were separated on SDS-PAGE after eluting the immune complex with Loemmli sample buffer at 100° C. for 3-4 min. Western blots were performed after transfer of proteins onto nitrocellulose or PVD membranes. The antipeptide antiserum was used as a primary antibody on western blots at a dilution of 1:500. A goat antirabbit antibody labeled with alkaline phosphatase was used as a secondary antibody. The substrate for the immunoprecipitation reactions were provided in the electrochemiluminescence (ECL) reaction (Amersham Pharmacia Biotech of Piscataway, N.J.).

[0332] To confirm that the antipeptide antibody recognized α₁S and α₁D in human bone marrow megakaryocytes, it was incubated for 60 minutes at r. t. on human bone marrow smears. The antiserum was used in 1:1000 dilution. A fluorescent labeled goat antirabbit antibody (Jackson ImmunoResearch Laboratories, Inc., West Grove, Pa.) was used as a secondary antibody.

[0333] To preserve the antigenicity of the determinant recognized by our antipeptide antibody, human platelets were processed using the post-embedding method described by Madden, V. J., Microscp. Microanal. (1998) 4(Supp. I12: proceedings):854-855. Fresh human platelets were allowed to settle and adhere on glass coverslips for ten minutes at r. t. Following gentle rinsing with PBS, the platelets were fixed in 2% paraformaldehyde, 0.5% glutaraldehyde in 0.1M sodium cacodylate, pH 7.4, and irradiated using a laboratory microwave oven (Ted Pella, Inc., Redding, Calif.). The cells were postfixed in 1% buffered osmium tetroxide for 10 minutes at r. t. They were dehydrated in acetone and infiltrated with L.R. White resin (London Resin Co., Ltd., Reading, England) using microwave irradiation. After polymerization with ultraviolet light (365 nm wavelength), the monolayers were sectioned en face at a thickness of 80 nm and mounted on 300 mesh nickel grids.

[0334] For immunocytochemical labeling, all steps were carried out at r. t. The anti-peptide antibody was diluted in 0.1M phosphate-buffered saline with globulin-free 0.1% bovine serum albumin, pH 7.4, and the secondary immunogold labeled antibody (Amersham Life Science Inc. of Arlington Heights, Ill.) was diluted in 0.1M Tris-buffered saline with 0.1% bovine serum albumin, pH 8.2. Grid-mounted sections were etched in 5% aqueous sodium metaperiodate for 30 min, then incubated in 0.2M glycine in PBS/BSA for 5 min. The sections were blocked in 5% normal goat serum in PBS/BSA for 10 min before incubation in the anti-peptide antibody (1:5 dilution) for 2 hours. After rinsing with PBS/BSA, the sections were incubated for 1 hour in goat anti-rabbit IgG 10 nm colloidal gold (1:25 dilution). Negative controls were performed concurrently by deleting primary antiserum and/or incubating in normal rabbit IgG at 1:5 dilution (Dako Corp. of Carpinteria, Calif.).

[0335] The immunogold-labeled sections were post-fixed in 1% glutaraldehyde in PBS followed by uranyl acetate and lead citrate treatment. The grids were observed and photographed using a LEO EM-910 transmission electron microscope (LEO Electron Microscopy, Thornwood, N.Y.) at 80 kV. Platelets were also prepared by conventional electron microscopy using 0.1% tannic acid in the fixative so as to obtain the optimal ultrastructural morphology of the surface-connected membranous open canalicular system.

Example 4 Two L-type VDCC α₁ Subunit cDNAs are Expressed in Platelets and Megakaryocytes

[0336] Reverse transcription-polymerase chain reaction (RT-PCR) was used to amplify, clone, and sequence flanking regions of VDCC α₁ subunit cDNA from cultured human megakaryocytes (Meg 01, ATCC) and from fresh porcine platelets. Two different isoforms were found to be expressed in human megakaryocytes. One exhibits sequence identity to the L-type VDCC α₁ subunit sequence from human skeletal muscle α₁S (Hogan, K., et al. Genomics (1994) 24:608-609), and is presented herein as SEQ ID NOs:1, 2, 5, and 28. The other exhibits sequence identity to the human neuro-endocrine cells α₁D sequence (Seino, S., et al., Proc Natl Acad Sci USA. (1992) 89:584-588 and FIGS. 1A and 1B), and is presented herein as SEQ ID NOs:3, 4, 6, and 29. These partial cDNAs encode regions between III S₄ and the carboxyl-end of α₁S, and between III S₅ and the carboxyl-end of α₁D, respectively (FIGS. 1A and 1B). Both cDNAs are missing nucleotides that encode a peptide in the putative extracellular linker between IV S₃ and IV S₄ (FIG. 1B). The intracytoplasmic loop between motifs III-III of the human α₁S was also sequenced (FIGS. 1A and 1B).

[0337] Similarly, a cDNA was obtained from circulating porcine platelets that corresponds to an encoded peptide in the region between IV S₃ and the amino acid 1531 of α₁S (FIGS. 1A and 1B). This platelet cDNA has 91% sequence identity with the rabbit α₁S skeletal muscle subunit (Tanabe T., et al., Nature (1987) 328:313-318), is presented herein as SEQ ID NOs:7 and 8 and demonstrates the same missing sequence as the above human sequence in the putative IV S₃ and IV S₄ linker (FIGS. 1A and 1B). Another porcine platelet cDNA which encodes the region between II S₃ and II S₆ of the α₁ subunit was sequenced. This partial porcine cDNA has 88% sequence identity to the human neuro-endocrine α₁D cDNA.

[0338] The cloned cDNAs indicate that two different α₁ subunits of L-type voltage dependent calcium channels are expressed in the platelet and their megakaryocytic precursors and that they encode polypeptides having sequence identity with the α₁ subunits of VDCCs from human skeletal muscle and neuro-endocrine cells. Importantly, the encoded polypeptides have regions of known active roles in the VDCC functions.

[0339] The cloned cDNAs predict the following structural and functional regions:

[0340] (a) pore lining segments in III S₅-S₆ and IV S₅- S₆ linkers (FIGS. 1A and 1B): VDCCs pores are lined by the amphipatic loops in the S₅-S₆ linkers in each of the four motifs, where a glutamate residue in equivalent positions conveys calcium selectivity to the ion-conducting channel;

[0341] (b) voltage sensing segments in transmembrane regions III S₄ and IV S₄ where the positively charged residues in every third or fourth position sense depolarization and induce conformational changes responsible for channel gating;

[0342] (c) dihydropyridine binding sites in III S₆ and IV S₆ characteristic of L-type calcium channels (Striessnig J., et al., Trends Pharmacol Sci. (1998) 19:108-115); and

[0343] (d) an intracytoplasmic loop between II S₆ and III S₁ that is unique to α₁S and believed to be specific for the type of excitation-contraction coupling in skeletal muscle.

[0344] The missing sequence in the IV S₃-S₄ linker in every cDNA cloned is envisioned to be a characteristic for the VDCC function in the platelets. A skipped exon has been detected in the same region in other L-type VDCCs. (Peres-Reyes E. and Schneider. T., Kidney Int. (1995) 48:1111-1124; Snutch, T. P., et al., Neuron (1991) 7:45-57; Diebold, R. J., et al., Proc Natl Acad Sci USA. (1992)89:1497-1501). Isoform specific riboprobes that correspond to the nucleotides between 3763 and 4796 of the human α₁S and the corresponding sequence from α₁D (FIG. 1C) were synthesized as cmRNA and used to probe separate mRNA blots from human megakaryocytes. A 6.5 kb transcript hybridized with the α₁S specific cmRNA, and a 10.8 kb transcript hybridized with the α₁D specific cmRNA on Northern blots (FIG. 1D). The sizes of these transcripts are consistent with those published for α₁S and α₁D from skeletal muscle and neuroendocrine mRNA respectively. Tanabe T., et al., Nature (1987) 328:313-318; Seino S., et al., Proc Natl Acad Sci U S A. (1992) 89:584-588.

Example 5 α₁S and α₁D VDCC Subunits are Expressed at the Protein Level in Platelets and Megakaryocytes

[0345] An antipeptide antibody was generated in a rabbit against the peptide sequence aa 1486-1506 (Tanabe T., et al., Nature (1987)328:313-318; 19, FIG. 1 arrow). This sequence is conserved in all published L-type VDCC α₁ subunits. This antibody recognizes its antigenic determinant in a 170 kDa polypeptide on Western blots from human megakaryocytes resolved by SDS-PAGE (FIG. 2A). This electrophoretic mobility is consistent with that of α₁S on SDS-PAGE previously described from skeletal muscle by Tanabe T., et al., Nature (1987) 328:313-318. That the 170 kDa polypeptide is indeed α₁S was confirmed with a monoclonal antibody (Mab 1A) previously published to recognize its determinant in α₁S from skeletal muscle (FIG. 2A), kindly provided by Dr. S. Froehner, Department of Cell Physiology, UNC-CH and described by Morton, M. E. and Froehner, S. C., J Biol Chem. (1987) 262:11904-11907.

[0346] Suspecting that α₁D is expressed at a level below the detection threshold on direct Western blots, the subunits were enriched by immuno-precipitation using the anti-peptide antibody described in Example 3 above. Polypeptides with 208 kDa and 170 kDa electrophoretic mobilities were immunoprecipitated from porcine platelets. When the primary antibody in the immunoprecipitation reaction was replaced with non-immune IgG, these polypeptides were not present. See FIG. 2B.

[0347] That the anti-peptide antibody recognizes the 208 kDa as α₁D was confirmed by Western blots on pancreatic β cells in culture resolved by SDS-PAGE, as shown in FIG. 2A. Pancreatic β cells in culture were kindly provided by Dr. Michael Freemark, Duke University Department of Pediatrics. The antibody recognized a determinant in a polypeptide with electrophoretic mobility of 208 kDa in the pancreatic β cells. Mab 1A did not recognize a determinant in these cells. The determinant recognized by Mab 1A has not been mapped and thus, might be specific to α₁S.

[0348] The expression of the α₁ subunits in in vivo megakaryocytes was demonstrated on human bone marrow smears with the anti-peptide antibody as a primary antibody and an FITC labeled anti-rabbit secondary antibody. A discrete speckled pattern was detected on the megakaryocytes and was interpreted to be α₁ subunits. Thus, the anti-peptide antibody recognized its determinant in a speckled pattern. The anti-peptide antibody was also replaced with non-immune rabbit IgG as a negative control. In the analysis, original magnification was 60×.

[0349] Thus, the polypeptide that has an electrophoretic mobility of 170 kDa corresponds to the 6.5 kb transcript identified on Northern blots and represents the α₁S subunit from human megakaryocytes and porcine platelets (SEQ ID NOs:1, 2, and 7-8, respectively). In contrast, the 208 kDa polypeptide corresponds to the 10.8 kb transcript on Northern blots and represents the α₁D subunit from human megakaryocytes and porcine platelets.

Example 6 The Platelet VDCC α₁ Subunit Polypeptide α₁ Subunit is Localized in a Tight Membranous Network of the Open Canalicular System

[0350] The site of probable function of the VDCC in the platelet was investigated at the ultrastructural level using the anti-peptide antibody to localize the antigenic determinant in unstimulated human platelets. An anti-rabbit secondary antibody labeled with 10 nm gold particles showed that the epitope recognized by the anti-peptide antibody was present in a tightly branching membranous network inside the platelets. This network forms a tortuous cribriform vesicular membranous system previously described by conventional electron microscopy to be continuous with the surface-connected open canalicular system (White, J. G., Am J Pathol. (1970)58:31-49; White, J. G., Am J Pathol. (1972)66:295-305).

[0351] In the examination of localization of VDCC, human platelets were post-embedded and incubated with the anti-peptide antibody as primary antibody. Ten nm gold particle labeled goat anti-rabbit antibody was the secondary antibody. Electron dense gold particles represented antigenic sites recognized by the anti-peptide antibody. This determinant was present in a membranous network continuous with the surface-connected open canalicular system. This network forms tightly branching vacuoles that extend deep inside the platelet. A sample was prepared by conventional EM to maintain the morphological integrity of the surface-connected open canalicular system. No gold particles were present when the anti-peptide antibody was replaced with normal rabbit IgG in a control experiment. In the analysis, original magnification was 16,000× and 500×.

Discussion of Examples

[0352] L-type VDCCs are generally co-localized with specialized intracellular organelles of which the function is calcium concentration dependent (Berridge, M. J., J Physiol (Lond). (1997) 499.2:291-306; Bokvist K., et al., EMBO J. (1995) 14:50-57; López-López, J. R., et al., Science (1995) 268:1042-1045; Shirokova, N., et al., J Physiol (Lond). 1998:512.2, 377-384). Activation of the VDCC in the cell membrane generates highly localized signals in the inner microenvironment that trigger responses strictly from the underlying organelles. Indeed, the L-type VDCC in striated muscle is preferentially localized in the t-tubule membrane, a sequestered surface membrane that invaginates from the cell membrane deep into the muscle cell. There, the VDCC is in close proximity with the calcium storing sarcoplasmic reticulum (SR) of muscle. Activation of the L-type VDCC in the t-tubule triggers the primary signal that causes release of Ca²⁺ from the SR lumen (Hille, B., Ionic channels of excitable membranes, Sunderland, Mass.: Sinauer Associate, Inc. Publishers (1992); López-López, J. R., et al., Science (1995) 268:1042-1045; Shirokova, N., et al., J Physiol (Lond). (1998) 512.2:377-384). This then generates the global Ca²⁺ signal that results in muscle contraction (Berridge, M. J., J Physiol (Lond). (1997)499.2:291-306).

[0353] Similarly, the L-type VDCC is clustered in the cell membrane of insulin secreting β cells in a domain that is adjacent to the highest density of secretory granules inside the cell (Bokvist K., et al., EMBO J. (1995) 14:50-57). Activation of the L-type VDCCs in the cell membrane results in a localized intracellular Ca²⁺ burst that initiates exocytosis and release of insulin from these granules.

[0354] These morphological relationships in muscle and endocrine cells between the VDCC in surface membrane domains and specialized intracellular organelles appear structurally homologous to those observed in the platelet, as discussed herein above. The surface-connected open canalicular system allows the platelet cell membrane to extend deeply into the platelet. There, some of its components are juxtaposed with the dense tubular system, the calcium storing compartment of the platelet (White, J. G., Am J Pathol. (1970) 58:31-49). Others act as a conduit in the platelet secretory pathway (White, J. G., Am J Pathol. (1972)66:295-305). The finding of a preferential localization of the VDCC α₁ subunit in this membranous network implicates the VDCC at these sites in specific functions of the platelet, such as contraction and secretion.

[0355] That the L-type voltage dependent calcium channel is physiologically active in platelets is supported by a study on the effects of the dihydropyridine antagonists on platelet aggregation (Pales, J., et al., Biochem. et Biophys. Acta (1991)1064:169-174). When used in a nanomolar range, these L-type calcium channel blockers inhibit platelet aggregation in a dose-dependent manner. Inhibition of Ca²⁺ channels at these submicromolar concentrations is believed to be selective for the L-type voltage dependent calcium channel (Lee, K. S., et al., Nature (1983) 302:790-794). Also, and of interest, is the unexplained finding that platelet aggregation time is prolonged in hypertensive patients receiving an L-type dihydropyridine calcium channel blocker for control of their hypertension (Sinzinger, H., et al., Eur J Clin Pharmacol. (1992) 42:43-46; Gebara, O. C., et al., Clin Cardiol. (1996) 19:205-211; Tison, P., et al., Am J Hypertens. (1994) 7:465-495).

[0356] Other studies, namely electrophysiological patch experiments, have not supported the role of VDCC in Ca²⁺ entry in the platelets (Mahaut-Smith, M. P., et al., J Biol Chem. (1992) 267:3060-3065) while other electrophysiologic studies have confirmed in phospholipid bilayers the presence of a calcium selective channel in platelet vesicles (Zschauer, A., et al., Nature (1998) 334:703-705). The preferential localization of the VDCC, as disclosed herein for the first time, in membranes deep inside the platelets could have masked these channels from patch experiments. Furthermore, the binding and antagonistic effects of dihydropyridines on the L-type VDCC are known to depend on the gated state of the channel. In the resting state this channel has a low affinity for these pharmacological agents (Bean, B. P., Proc Natl Acad Sci U S A. (1984)81:6388-6392). This gated dependent property could complicate the interpretation of results from experiments that use channel blockers on platelet aggregation and could explain some of the discrepancies reported on their effects on platelet function in different assays.

[0357] Tissue-specific expression of VDCC α₁ subunits confers a unique function to that tissue or organ. The expression of α₁D in neuroendocrine cells reflects the characteristics of the VDCC in excitation-secretion coupling in these cells. Similarly, the expression of α₁S in skeletal muscle (Birnbaumer L., et al., Neuron (1994) 13:505-506; Tanabe T., et al., Nature (1987) 328:313-318) or α₁C in cardiac muscle (Birnbaumer L., et al., Neuron (1994) 13:505-506; Mikami, A., et al., Nature (1989) 340:230-233) reflects the characteristics of the corresponding VDCC in excitation-contraction coupling in skeletal (Shirokova, N., et al., J Physiol (Lond). (1998) 512.2:377-384) and cardiac (López-López, J. R., et al., Science (1995) 268:1042-1045) muscles respectively. Extrapolating the expression of α₁D and α₁S subunits to the function of the platelet, the corresponding channels are implicated in excitation-secretion and excitation-contraction couplings in platelets.

REFERENCES

[0358] The publications and other materials listed below and/or set forth by author and date in the text above to illuminate the background of the invention, and in particular cases, to provide additional details respecting the practice, are incorporated herein by reference. Materials used herein include but are not limited to the following listed references.

[0359] Abbrachio, M. P., et al., Pharmacol Ther. (1994) 64:445-475.

[0360] Adelman et al., DNA 2:183 (1983).

[0361] Ahuja et al., Cancer Res. (1998) 23:5489-94.

[0362] Armstrong, C. M. and Hille, B., Neuron (1998) 20:371-380.

[0363] Ashcroft, F. M., Ion channels and disease. San Diego, Calif.: Academic Press; (2000).

[0364] Ausubel et al. (1992) Current Protocols in Molecular Biology, (J. Wylie & Sons, N.Y.)

[0365] Bean, B. P. Proc Natl Acad Sci USA. (1984)81:6388-6392.

[0366] Bean, B. P., Trends Phys Sci. (1993) 13:87-90.

[0367] Belinsky et al., Proc. Natl. Acad. Sci. USA (1998) 20:11891-6.

[0368] Berridge, M. J., J Physiol (Lond). (1997) 499.2:291-306.

[0369] Bhenke, O. (1992) J. Submicroscopic Cytology and Pathology 24:169-178.

[0370] Birnbaumer L., et al., Neuron (1994) 13:505-506.

[0371] Bitgood et al. (1996) Curr. Biol. 6, 298-304.

[0372] Bodanszky, et al., “Peptide Synthesis”, John Wiley & Sons, Second Edition, 1976.

[0373] Bokvist K., et al., EMBO J. (1995) 14:50-57.

[0374] Boyd, A. E., J Cell Biochem. (1992) 48:234-261.

[0375] Crabtree, G. and Henderson, J. (1971) Cancer Res. 31:985-991.

[0376] Crea et al. (1978) Proc. Natl. Acad. Sci. U.S.A, 75:5765.

[0377] Diebold, R. J., et al., Proc Natl Acad Sci U S A. 1992: 89, 1497-1501.

[0378] Eichenlaub et al. (1979) J. Bacteriol 138:559-566.

[0379] Fields et al., Int. J. Peptide Protein Res., 35:161-214, 1990

[0380] Fischer, T., et al. (2000) Brit. J. Haem. (in press)

[0381] Gebara, O. C., et al., Clin Cardiol. (1996) 19:205-211.

[0382] Gribskov et al., Nucl. Acids. Res. 14:6745 (1986).

[0383] Hiki et al. (1999) J Biol Chem 274(15):10661-7.

[0384] Hille, B., Ionic channels of excitable membranes, Sunderland, Mass.: Sinauer Associate, Inc. Publishers (1992).

[0385] Hogan, K., et al. Genomics (1994) 24:608-609.

[0386] Holmsen, H. (1972) Ann. N.Y. Acad. Sci 201:109-115.

[0387] Holtzman, E. J., et al., (1998) Am. J. Physiol. 275:F550-F564

[0388] Hopp, U.S. Pat. No. 4,554,101.

[0389] Harlow et al. (1988) Antibodies A Laboratory Manual, (Cold Spring Harbor Laboratory).

[0390] Jin, J. and Kanapuli S., Proc Natl Acad Sci U S A. (1998) 95:8070-8074.

[0391] Kanapuli, S., Trends Pharmacol Sci. (1998) 19:391-394.

[0392] Khandelwal, G., et al. (1997) FASEB J. 11:1812.

[0393] Kyte et al. (1982) J. Mol. Biol. 157:105.

[0394] Lee, K. S., et al., Nature (1983) 302:790-794.

[0395] López-López, J. R., et al., Science (1995) 268:1042-1045.

[0396] Mackenzie, A. B., et al., J Biol Chem. 1996: 271, 2879-2881.

[0397] Madden, V. J., Microscp. Microanal. (1998) 4 (Suppl2: proceedings), 854-855.

[0398] Mahaut-Smith, M. P., et al., J Biol Chem. (1992) 267:3060-3065.

[0399] Makalowski, W., and Boguski, M. S. (1998) Proc.Natl.Acad.Sci.USA95:9407-9412.

[0400] Malouf, N. N., et al. (1987) Proc. Natl. Acad. Sci. USA 84:5019-5023.

[0401] Maniatis et al. (1978) Cell 15:687-701.

[0402] McOmie, Protective Groups in Organic Chemistry, Plenum Press, New York, (1973)

[0403] Meienhofer, Hormonal Proteins and Peptides, Vol.2, p. 46, Academic Press, New York, (1983)

[0404] Merricks, E., et al. (1998) Blood 10(S1-2):71b.

[0405] Merrifield (1969) Adv. Enzymol 32:221-96.

[0406] Messing et al. (1981) Third Cleveland Symposium on Macromolecules and Recombinant DNA, Editor A. Walton, (Elsevier, Amsterdam).

[0407] Mikami, A., et al., Nature (1989) 340:230-233.

[0408] Mills, D. C., Thromb Haemost. 1996: 76, 835-856.

[0409] Morton M. E., Froehner S. C. J Biol Chem. 1987: 262, 11904-11907.

[0410] Needleman et al. (1970) J. Mol. Biol. 48:443.

[0411] Novak, E. and Rabinovitch, P. (1994) Cytometry 17:135-141.

[0412] Ochman et al. (1990) Amplification of flanking sequences by Inverse PCR, in PCR protocols: a Guide to Methods and Applications (Innis et al., eds.) pp. 219-227. Academic Press, San Diego, Calif.

[0413] Palés, J., et al., Biochem. et Biophys. Acta (1991) 1064:169-174.

[0414] Peres-Reyes, E. and Schneider, T., Kidney Int. (1995) 48:1111-1124.

[0415] Sage, S. O., Exp Physiol. (1997) 82:807-823.

[0416] Sambrook et al. (1989) Molecular Cloning: A Laboratory Manual (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.).

[0417] Sanders, W., et al. (1996) Blood 88:S107.

[0418] Schroder et al., “The Peptides”, Vol.1, Academic Press (New York) (1965).

[0419] Schwartz et al., eds., Atlas of Protein Sequence and Structure, National Biomedical Research Foundation, pp. 357-358 (1979).

[0420] Seino, S., et al., Proc Natl Acad Sci U S A. (1992) 89:584-588.

[0421] Shirokova, N., et al., J Physiol (Lond). 1998: 512.2, 377-384.

[0422] Sinzinger, H., et al., Eur J Clin Pharmacol. (1992) 42:43-46.

[0423] Smith et al., Adv. Appl. Math. 2:482 (1981).

[0424] Snutch, T. P., et al., Neuron (1991) 7:45-57.

[0425] Steward et al. (1969) Solid Phase Peptide Synthesis, W. H. Freeman Co., San Francisco.

[0426] Striessnig, J., et al., Trends Pharmacol Sci. (1998) 19:108-115.

[0427] Strühmer, et al., Nature (1989) 339:597-603.

[0428] Sun B., et al. J Biol Chem. (1998) 273:11544-11547.

[0429] Surprenant A., Trends Neurosci. (1995) 18:224-229.

[0430] Tanabe T., et al., Nature (1987) 328:313-318.

[0431] Tanabe T., et al., Nature (1990) 346:567-569.

[0432] Tison, P., et al., Am J Hypertens. (1994) 7:465-495.

[0433] U.S. Pat. No. 5,614,396

[0434] U.S. Pat. No. 5,589,375

[0435] U.S. Pat. No. 5,624,816

[0436] U.S. Pat. No. 5,583,103

[0437] U.S. Pat. No. 5,580,979

[0438] U.S. Pat. No. 5,734,033

[0439] U.S. Pat. No. 5,723,593

[0440] U.S. Pat. No. 5,739,278

[0441] U.S. Pat. No. 5,753,687

[0442] U.S. Pat. No. 5,641,484

[0443] U.S. Pat. No. 5,693,488

[0444] U.S. Pat. No. 5,399,346

[0445] U.S. Pat. No. 5,352,660

[0446] U.S. Pat. No. 5,326,902

[0447] U.S. Pat. No. 5,286,634

[0448] U.S. Pat. No. 5,279,833

[0449] U.S. Pat. No. 5,234,933

[0450] U.S. Pat. No. 5,162,215

[0451] U.S. Pat. No. 5,741,957

[0452] U.S. Pat. No. 5,120,535

[0453] U.S. Pat. No. 4,769,331

[0454] U.S. Pat. No. 4,736,866

[0455] U.S. Pat. No. 4,686,283

[0456] U.S. Pat. No. 5,651,964

[0457] U.S. Pat. No. 5,573,933

[0458] U.S. Pat. No. 5,550,316

[0459] U.S. Pat. No. 5,625,125

[0460] U.S. Pat. No. 4,683,202

[0461] U.S. Pat. No. 4,244,946

[0462] U.S. Pat. No. 5,643,567

[0463] U.S. Pat. No. 4,196,265

[0464] U.S. Pat. No. 3,095,355

[0465] U.S. Pat. No. 2,868,691

[0466] U.S. Pat. No. 5,489,742

[0467] U.S. Pat. No. 5,436,288

[0468] U.S. Pat. No. 5,627,158

[0469] U.S. Pat. No. 5,648,061

[0470] U.S. Pat. No. 5,646,008

[0471] U.S. Pat. No. 5,837,479

[0472] U.S. Pat. No. 5,786,152

[0473] U.S. Pat. No. 5,776,902

[0474] U.S. Pat. No. 5,645,999

[0475] U.S. Pat. No. 5,770,609

[0476] U.S. Pat. No. 5,780,436

[0477] Wetmur & Davidson (1968) J. Mol. Biol. 31:349-370.

[0478] White, J. G., Am J Pathol. (1970) 58:31-49.

[0479] White, J. G., Am J Pathol. (1972) 66:295-305.

[0480] White, T. E., et al. (1990) Proc. Natl. Acad. Sci. 87:758-762.

[0481] WO 93/25521

[0482] WO 96/40276

[0483] Yang J., et al., Nature (1993) 366:158-161.

[0484] Zimmer et al., Peptides 1992, pp. 393-394, ESCOM Science Publishers, B. V. 1993.

[0485] Zschauer, A., et al., Nature (1998) 334:703-705.

[0486] It will be understood that various details of the invention can be changed without departing from the scope of the invention. Furthermore, the foregoing description is for the purpose of illustration only, and not for the purpose of limitation—the invention being defined by the claims.

1 29 1 5565 DNA Homo sapiens CDS (1)..(5565) 1 atg gag cca tcc tca ccc cag gat gaa ggc ctg agg aag aaa cag ccc 48 Met Glu Pro Ser Ser Pro Gln Asp Glu Gly Leu Arg Lys Lys Gln Pro 1 5 10 15 aag aag cca gtt cct gag att ctg cca agg cca ccc cgg gct ttg ttc 96 Lys Lys Pro Val Pro Glu Ile Leu Pro Arg Pro Pro Arg Ala Leu Phe 20 25 30 tgc ctg acc ctg gag aac ccc ctg agg aag gcc tgc atc agc att gta 144 Cys Leu Thr Leu Glu Asn Pro Leu Arg Lys Ala Cys Ile Ser Ile Val 35 40 45 gaa tgg aag ccc ttc gag acg atc atc ttg ctc acc atc ttt gcc aat 192 Glu Trp Lys Pro Phe Glu Thr Ile Ile Leu Leu Thr Ile Phe Ala Asn 50 55 60 tgt gtg gcc ctg gcc gtg tac ctg ccc atg ccg gaa gat gac aac aac 240 Cys Val Ala Leu Ala Val Tyr Leu Pro Met Pro Glu Asp Asp Asn Asn 65 70 75 80 tct ctg aac ctc ggc ctg gag aag ctg gag tat ttc ttc ctc att gtc 288 Ser Leu Asn Leu Gly Leu Glu Lys Leu Glu Tyr Phe Phe Leu Ile Val 85 90 95 ttc tcg att gaa gcc gcc atg aag atc att gcc tac ggc ttc tta ttc 336 Phe Ser Ile Glu Ala Ala Met Lys Ile Ile Ala Tyr Gly Phe Leu Phe 100 105 110 cac cag gac gct tac ctg cgc agt ggc tgg aat gtg ctg gac ttc acc 384 His Gln Asp Ala Tyr Leu Arg Ser Gly Trp Asn Val Leu Asp Phe Thr 115 120 125 att gtc ttc ctg ggg gtc ttc acc gtg att ctg gaa cag gtt aac gtc 432 Ile Val Phe Leu Gly Val Phe Thr Val Ile Leu Glu Gln Val Asn Val 130 135 140 atc caa agc cac aca gcc cca atg agc agc aaa gga gcc ggc ttg gat 480 Ile Gln Ser His Thr Ala Pro Met Ser Ser Lys Gly Ala Gly Leu Asp 145 150 155 160 gtc aag gcc ctc aga gcc ttc cga gtg ctc aga ccc ctc cgg ctg gtg 528 Val Lys Ala Leu Arg Ala Phe Arg Val Leu Arg Pro Leu Arg Leu Val 165 170 175 tcg ggg gtg cct agc ctg cag gtg gtc ctg aac tcc atc ttc aag gcc 576 Ser Gly Val Pro Ser Leu Gln Val Val Leu Asn Ser Ile Phe Lys Ala 180 185 190 atg ctc ccc ctc ttt cac atc gcc ctg ctg gtc ctc ttt atg gtc atc 624 Met Leu Pro Leu Phe His Ile Ala Leu Leu Val Leu Phe Met Val Ile 195 200 205 atc tat gcc atc atc ggg ctg gag ctc ttc aag ggc aag atg cac aag 672 Ile Tyr Ala Ile Ile Gly Leu Glu Leu Phe Lys Gly Lys Met His Lys 210 215 220 acc tgc tac ttc att ggt aca gat atc gtg gcc acg gtg gag aat gaa 720 Thr Cys Tyr Phe Ile Gly Thr Asp Ile Val Ala Thr Val Glu Asn Glu 225 230 235 240 gag cca tcg ccc tgc gcc agg acg ggc tca ggg cgc cgg tgc acc atc 768 Glu Pro Ser Pro Cys Ala Arg Thr Gly Ser Gly Arg Arg Cys Thr Ile 245 250 255 aat ggc agt gag tgc cgg ggc ggc tgc cca ggg ccc aac cat ggc atc 816 Asn Gly Ser Glu Cys Arg Gly Gly Cys Pro Gly Pro Asn His Gly Ile 260 265 270 acc cac ttc gac aac ttc ggc ttc tcc atg ctc acc gtg tac cag tgc 864 Thr His Phe Asp Asn Phe Gly Phe Ser Met Leu Thr Val Tyr Gln Cys 275 280 285 att acc atg gag gga tgg act gac gtc ctt tac tgg gtc aat gat gcc 912 Ile Thr Met Glu Gly Trp Thr Asp Val Leu Tyr Trp Val Asn Asp Ala 290 295 300 atc ggg aat gag tgg ccc tgg atc tat ttt gtc acc ctc att ttg ctg 960 Ile Gly Asn Glu Trp Pro Trp Ile Tyr Phe Val Thr Leu Ile Leu Leu 305 310 315 320 gga tcc ttc ttc atc ctc aac ctg gtg ctg ggt gtc ctg agt ggg gaa 1008 Gly Ser Phe Phe Ile Leu Asn Leu Val Leu Gly Val Leu Ser Gly Glu 325 330 335 ttc acc aag gag cgg gag aag gcc aag tcc agg gga acc ttc cag aag 1056 Phe Thr Lys Glu Arg Glu Lys Ala Lys Ser Arg Gly Thr Phe Gln Lys 340 345 350 ctc cgg gag aag cag caa cta gat gag gac ctt cgg ggc tac atg agc 1104 Leu Arg Glu Lys Gln Gln Leu Asp Glu Asp Leu Arg Gly Tyr Met Ser 355 360 365 tgg atc acg cag ggc gag gtc atg gat gtt gag gac ttc aga gaa gga 1152 Trp Ile Thr Gln Gly Glu Val Met Asp Val Glu Asp Phe Arg Glu Gly 370 375 380 aaa ctg tct ttg gat gaa ggt ggc tct gac aca gag agc ctg tat gaa 1200 Lys Leu Ser Leu Asp Glu Gly Gly Ser Asp Thr Glu Ser Leu Tyr Glu 385 390 395 400 att gca ggc ttg aac aaa atc atc cag ttc atc cga cat tgg agg cag 1248 Ile Ala Gly Leu Asn Lys Ile Ile Gln Phe Ile Arg His Trp Arg Gln 405 410 415 tgg aac cgc atc ttt cgc tgg aag tgc cat gac atc gtg aag tcc aag 1296 Trp Asn Arg Ile Phe Arg Trp Lys Cys His Asp Ile Val Lys Ser Lys 420 425 430 gtc ttc tat tgg ctg gtg att ctc atc gtt gcc ctc aac acc ctg tct 1344 Val Phe Tyr Trp Leu Val Ile Leu Ile Val Ala Leu Asn Thr Leu Ser 435 440 445 atc gcc tca gag cac cac aac cag ccg cac tgg ctg acc cgt ttg caa 1392 Ile Ala Ser Glu His His Asn Gln Pro His Trp Leu Thr Arg Leu Gln 450 455 460 gac att gcc aac cgg gtg ctg ctg tcc ctc ttc acc act gag atg ctg 1440 Asp Ile Ala Asn Arg Val Leu Leu Ser Leu Phe Thr Thr Glu Met Leu 465 470 475 480 atg aag atg tac ggg ctg ggc ctg cgc cag tac ttc atg tct atc ttc 1488 Met Lys Met Tyr Gly Leu Gly Leu Arg Gln Tyr Phe Met Ser Ile Phe 485 490 495 aac cgc ttc gac tgc ttc gtg gtg tgc agc ggt atc ctg gag atc ctg 1536 Asn Arg Phe Asp Cys Phe Val Val Cys Ser Gly Ile Leu Glu Ile Leu 500 505 510 ctg gtg gag tcg ggc gcc atg aca ccc ctg ggc atc tcc gtg ctc cgc 1584 Leu Val Glu Ser Gly Ala Met Thr Pro Leu Gly Ile Ser Val Leu Arg 515 520 525 tgc atc cgc ctc ctg agg atc ttc aag atc acc aaa tat tgg acg tcg 1632 Cys Ile Arg Leu Leu Arg Ile Phe Lys Ile Thr Lys Tyr Trp Thr Ser 530 535 540 ctg agc aac ctg gtg gca tcc ctg ctc aac tcc atc cgc tcc atc gcc 1680 Leu Ser Asn Leu Val Ala Ser Leu Leu Asn Ser Ile Arg Ser Ile Ala 545 550 555 560 tcc ctg ctg ctg ctg ctc ttc ctc ttc atc gtc atc ttc cgc ctc ctg 1728 Ser Leu Leu Leu Leu Leu Phe Leu Phe Ile Val Ile Phe Arg Leu Leu 565 570 575 ggc atg cag ctc ttt ggg ggg agg tat gac ttt gaa gac aca gaa gta 1776 Gly Met Gln Leu Phe Gly Gly Arg Tyr Asp Phe Glu Asp Thr Glu Val 580 585 590 cgg cgc agc aac ttt gac aac ttt ccc caa gcc ctc atc agc gtc ttc 1824 Arg Arg Ser Asn Phe Asp Asn Phe Pro Gln Ala Leu Ile Ser Val Phe 595 600 605 cag gta ctg aca ggg gaa gac tgg acc tca atg atg tac aat ggg atc 1872 Gln Val Leu Thr Gly Glu Asp Trp Thr Ser Met Met Tyr Asn Gly Ile 610 615 620 atg gcc tcg agc ggg ccg tcc tac cct ggc atg ctt gtg tgc att tac 1920 Met Ala Ser Ser Gly Pro Ser Tyr Pro Gly Met Leu Val Cys Ile Tyr 625 630 635 640 ttc atc atc ctt ttc gtc tgt ggc aac tac atc ctg ctc aat gtc ttc 1968 Phe Ile Ile Leu Phe Val Cys Gly Asn Tyr Ile Leu Leu Asn Val Phe 645 650 655 ctg gcc att gcc gtg gac aac ctg gcc gag gcg gag agc ctg act tct 2016 Leu Ala Ile Ala Val Asp Asn Leu Ala Glu Ala Glu Ser Leu Thr Ser 660 665 670 gcc cag aag gcc aag gct gag gag aaa aaa cgc agg aag atg tcc aag 2064 Ala Gln Lys Ala Lys Ala Glu Glu Lys Lys Arg Arg Lys Met Ser Lys 675 680 685 ggt ctc cca gac aag tca gaa gag gag aag tca acg atg gcc aag aag 2112 Gly Leu Pro Asp Lys Ser Glu Glu Glu Lys Ser Thr Met Ala Lys Lys 690 695 700 ctg gag cag aaa ccc aag ggt gag ggc atc ccc acc act gcc aag ctg 2160 Leu Glu Gln Lys Pro Lys Gly Glu Gly Ile Pro Thr Thr Ala Lys Leu 705 710 715 720 aaa atc gat gag ttt gaa tct aat gtc aat gag gtg aag gat ccc tac 2208 Lys Ile Asp Glu Phe Glu Ser Asn Val Asn Glu Val Lys Asp Pro Tyr 725 730 735 ccc tca gcc gac ttc cca ggg gat gac gag gaa gat gag cct gag atc 2256 Pro Ser Ala Asp Phe Pro Gly Asp Asp Glu Glu Asp Glu Pro Glu Ile 740 745 750 ccg ctg agc ccc cga cca cgt ccc ctg gct gag ctg cag ctg aaa gag 2304 Pro Leu Ser Pro Arg Pro Arg Pro Leu Ala Glu Leu Gln Leu Lys Glu 755 760 765 aag gcc gtg ccc att cca gaa gcc agc tcc ttc ttc atc ttc agc ccc 2352 Lys Ala Val Pro Ile Pro Glu Ala Ser Ser Phe Phe Ile Phe Ser Pro 770 775 780 acc aat aag atc cgt gtc ctg tgt cac cgc atc gtc aat gcc acc tgg 2400 Thr Asn Lys Ile Arg Val Leu Cys His Arg Ile Val Asn Ala Thr Trp 785 790 795 800 ttc acc aac ttc atc ctg ctc ttc atc ctg ctc agc agc gct gca ctg 2448 Phe Thr Asn Phe Ile Leu Leu Phe Ile Leu Leu Ser Ser Ala Ala Leu 805 810 815 gct gcg gaa gac ccc atc cgg gct gat tcc atg aga aat cag atc ctt 2496 Ala Ala Glu Asp Pro Ile Arg Ala Asp Ser Met Arg Asn Gln Ile Leu 820 825 830 aaa cac ttt gac atc ggg ttc acc tct gtc ttc act gtg gag att gtc 2544 Lys His Phe Asp Ile Gly Phe Thr Ser Val Phe Thr Val Glu Ile Val 835 840 845 ctc aag atg acg acc tac gga gcc ttc ctg cac aag ggt tcc ttc tgc 2592 Leu Lys Met Thr Thr Tyr Gly Ala Phe Leu His Lys Gly Ser Phe Cys 850 855 860 cgc aat tac ttc aac atg ctg gac ctg ctg gtg gtg gcc gtg tcc ctc 2640 Arg Asn Tyr Phe Asn Met Leu Asp Leu Leu Val Val Ala Val Ser Leu 865 870 875 880 atc tcc atg gga ctt gag tcc agt gcc atc tcc gtg gtg aag atc ctg 2688 Ile Ser Met Gly Leu Glu Ser Ser Ala Ile Ser Val Val Lys Ile Leu 885 890 895 agg gtg ctg agg gtg ctc cga cca ctc aga gcc atc aac aga gcc aag 2736 Arg Val Leu Arg Val Leu Arg Pro Leu Arg Ala Ile Asn Arg Ala Lys 900 905 910 ggg ttg aag cac gtg gct agg tgc atg ttc gtg gcc atc agc acc atc 2784 Gly Leu Lys His Val Ala Arg Cys Met Phe Val Ala Ile Ser Thr Ile 915 920 925 ggg aac atc gtg ctg gtc act acc ctc cta cag ttc atg ttt gcc tgc 2832 Gly Asn Ile Val Leu Val Thr Thr Leu Leu Gln Phe Met Phe Ala Cys 930 935 940 atc ggc gtc cag ctc ttc aag ggg aag ttc ttc agg tgc acc gac ttg 2880 Ile Gly Val Gln Leu Phe Lys Gly Lys Phe Phe Arg Cys Thr Asp Leu 945 950 955 960 tcc aag atg aca gag gag gag tgc agg ggc tac tac tac gtg tac aag 2928 Ser Lys Met Thr Glu Glu Glu Cys Arg Gly Tyr Tyr Tyr Val Tyr Lys 965 970 975 gac ggg gac ccc atg cag ata gag ctg cgt cac cgc gag tgg gta cac 2976 Asp Gly Asp Pro Met Gln Ile Glu Leu Arg His Arg Glu Trp Val His 980 985 990 agc gac ttc cac ttc gac aat gtg ctc tca gcc atg atg tcc ctc ttc 3024 Ser Asp Phe His Phe Asp Asn Val Leu Ser Ala Met Met Ser Leu Phe 995 1000 1005 acg gtc tcc acc ttc gag gga tgg cct cag ctg ctg tac aag gcc 3069 Thr Val Ser Thr Phe Glu Gly Trp Pro Gln Leu Leu Tyr Lys Ala 1010 1015 1020 ata gac tcc aat gcg gag gac gtg ggt ccc atc tac aac aac cgt 3114 Ile Asp Ser Asn Ala Glu Asp Val Gly Pro Ile Tyr Asn Asn Arg 1025 1030 1035 gtg gag atg gcc atc ttc ttc atc atc tac atc atc ctc att gcc 3159 Val Glu Met Ala Ile Phe Phe Ile Ile Tyr Ile Ile Leu Ile Ala 1040 1045 1050 ttc ttc atg atg aac atc ttt gtg ggc ttc gtc att gtc acc ttc 3204 Phe Phe Met Met Asn Ile Phe Val Gly Phe Val Ile Val Thr Phe 1055 1060 1065 cag gag cag gga gag act gag tac aag aac tgt gag ctg gac aag 3249 Gln Glu Gln Gly Glu Thr Glu Tyr Lys Asn Cys Glu Leu Asp Lys 1070 1075 1080 aac cag cgc caa tgt gta cag tat gcc ctg aag gcc cgc cca ctg 3294 Asn Gln Arg Gln Cys Val Gln Tyr Ala Leu Lys Ala Arg Pro Leu 1085 1090 1095 agg tgc tac att ccc aaa aac cca tac cag tac cag gtg tgg tac 3339 Arg Cys Tyr Ile Pro Lys Asn Pro Tyr Gln Tyr Gln Val Trp Tyr 1100 1105 1110 att gtc acc tcc tcc tac ttt gaa tac ctg atg ttt gcc ctc atc 3384 Ile Val Thr Ser Ser Tyr Phe Glu Tyr Leu Met Phe Ala Leu Ile 1115 1120 1125 atg ctc aac acc atc tgc ctc ggc atg cag cac tac aac cag tcg 3429 Met Leu Asn Thr Ile Cys Leu Gly Met Gln His Tyr Asn Gln Ser 1130 1135 1140 gag cag atg aac cac atc tca gac atc ctc aat gtg gcc ttc act 3474 Glu Gln Met Asn His Ile Ser Asp Ile Leu Asn Val Ala Phe Thr 1145 1150 1155 atc atc ttc acc ctg gag atg atc ctc aag ctc atg gcc ttc aag 3519 Ile Ile Phe Thr Leu Glu Met Ile Leu Lys Leu Met Ala Phe Lys 1160 1165 1170 gcc agg ggc tac ttt gga aac ccc tgg aat gtg ttt gac ttc ctg 3564 Ala Arg Gly Tyr Phe Gly Asn Pro Trp Asn Val Phe Asp Phe Leu 1175 1180 1185 att gtc att ggc agc atc att gat gtc atc ctc agt gag atc gac 3609 Ile Val Ile Gly Ser Ile Ile Asp Val Ile Leu Ser Glu Ile Asp 1190 1195 1200 gac cca gat gag agt gcc cgc atc tcc agc gcc ttc ttc cgc ctg 3654 Asp Pro Asp Glu Ser Ala Arg Ile Ser Ser Ala Phe Phe Arg Leu 1205 1210 1215 ttc cgt gtc atg agg ctg atc aag ctg ctg agc cgg gca gaa gga 3699 Phe Arg Val Met Arg Leu Ile Lys Leu Leu Ser Arg Ala Glu Gly 1220 1225 1230 gtg cga acc ctc ctg tgg acg ttc atc aag tcc ttc cag gcc cta 3744 Val Arg Thr Leu Leu Trp Thr Phe Ile Lys Ser Phe Gln Ala Leu 1235 1240 1245 ccc tac gtg gct ctg ctc atc gtc atg ctc ttc ttc atc tac gct 3789 Pro Tyr Val Ala Leu Leu Ile Val Met Leu Phe Phe Ile Tyr Ala 1250 1255 1260 gtc atc ggc atg cag atg ttt ggg aag atc gcc ttg gtg gat ggg 3834 Val Ile Gly Met Gln Met Phe Gly Lys Ile Ala Leu Val Asp Gly 1265 1270 1275 acc caa ata aac cgg aac aac aac ttc cag acc ttc cca caa gct 3879 Thr Gln Ile Asn Arg Asn Asn Asn Phe Gln Thr Phe Pro Gln Ala 1280 1285 1290 gtg cta ctg ctc ttc agg tgt gca aca ggt gag gcc tgg cag gag 3924 Val Leu Leu Leu Phe Arg Cys Ala Thr Gly Glu Ala Trp Gln Glu 1295 1300 1305 atc cta ctg gcc tgc agc tat ggg aag ctg tgt gac cca gag tcg 3969 Ile Leu Leu Ala Cys Ser Tyr Gly Lys Leu Cys Asp Pro Glu Ser 1310 1315 1320 gac tat gcc cca ggg gag gag tac aca tgt ggc acc aac ttt gca 4014 Asp Tyr Ala Pro Gly Glu Glu Tyr Thr Cys Gly Thr Asn Phe Ala 1325 1330 1335 tac tac tac ttc atc agc ttc tac atg ctc tgt gcc ttc ctg gtc 4059 Tyr Tyr Tyr Phe Ile Ser Phe Tyr Met Leu Cys Ala Phe Leu Val 1340 1345 1350 atc aac ctc ttt gtg gct gtc atc atg gac aat ttt gac tac ctc 4104 Ile Asn Leu Phe Val Ala Val Ile Met Asp Asn Phe Asp Tyr Leu 1355 1360 1365 acc cgg gac tgg tcc atc ctg ggc cct cat cac ctg gat gag ttc 4149 Thr Arg Asp Trp Ser Ile Leu Gly Pro His His Leu Asp Glu Phe 1370 1375 1380 aag gcc atc tgg gca gag tat gac cca gag gct aag ggg agg atc 4194 Lys Ala Ile Trp Ala Glu Tyr Asp Pro Glu Ala Lys Gly Arg Ile 1385 1390 1395 aaa cac ctg gac gtg gtg acc ctg ctg aga agg att cag ccc cct 4239 Lys His Leu Asp Val Val Thr Leu Leu Arg Arg Ile Gln Pro Pro 1400 1405 1410 ctg ggc ttt ggg aag ttc tgc cca cat cgg gta gct tgt aag cgg 4284 Leu Gly Phe Gly Lys Phe Cys Pro His Arg Val Ala Cys Lys Arg 1415 1420 1425 ctg gtg ggc atg aac atg ccc ctg aac agc gac ggc aca gtc acc 4329 Leu Val Gly Met Asn Met Pro Leu Asn Ser Asp Gly Thr Val Thr 1430 1435 1440 ttc aat gcc aca ctc ttt gcc ctg gtc cgc acg gca ctc aag atc 4374 Phe Asn Ala Thr Leu Phe Ala Leu Val Arg Thr Ala Leu Lys Ile 1445 1450 1455 aag acg gaa ggt aac ttt gag cag gcc aac gag gag ctg agg gcc 4419 Lys Thr Glu Gly Asn Phe Glu Gln Ala Asn Glu Glu Leu Arg Ala 1460 1465 1470 atc atc aag aag atc tgg aag aga acc agc atg aag ctc ttg gac 4464 Ile Ile Lys Lys Ile Trp Lys Arg Thr Ser Met Lys Leu Leu Asp 1475 1480 1485 cag gtc atc cct cca ata gga gat gat gag gtg aca gtg ggg aag 4509 Gln Val Ile Pro Pro Ile Gly Asp Asp Glu Val Thr Val Gly Lys 1490 1495 1500 ttc tac gcc aca ttc ctc atc cag gag cac ttc cgg aag ttc atg 4554 Phe Tyr Ala Thr Phe Leu Ile Gln Glu His Phe Arg Lys Phe Met 1505 1510 1515 aaa cgc caa gag gag tat tat ggc tat cgg ccc aag aag gac att 4599 Lys Arg Gln Glu Glu Tyr Tyr Gly Tyr Arg Pro Lys Lys Asp Ile 1520 1525 1530 gta cag atc cag gca ggg ctg cgg acc att gag gaa gag gca gcc 4644 Val Gln Ile Gln Ala Gly Leu Arg Thr Ile Glu Glu Glu Ala Ala 1535 1540 1545 ccc gag atc tgt cgc acg gtc tca gga gac ctg gct gct gag gag 4689 Pro Glu Ile Cys Arg Thr Val Ser Gly Asp Leu Ala Ala Glu Glu 1550 1555 1560 gag ctg gag aga gcc atg gtg gag gct gcg atg gag gag ggg ata 4734 Glu Leu Glu Arg Ala Met Val Glu Ala Ala Met Glu Glu Gly Ile 1565 1570 1575 ttc cgg agg act gga ggc ctg ttt ggc cag gtg gac aac ttc ctg 4779 Phe Arg Arg Thr Gly Gly Leu Phe Gly Gln Val Asp Asn Phe Leu 1580 1585 1590 gaa agg acc aac tcc ctg ccc cct gtc atg gcc aat cag aga ccc 4824 Glu Arg Thr Asn Ser Leu Pro Pro Val Met Ala Asn Gln Arg Pro 1595 1600 1605 ctc cag ttt gct gag ata gag atg gaa gag atg gag tca cct gtc 4869 Leu Gln Phe Ala Glu Ile Glu Met Glu Glu Met Glu Ser Pro Val 1610 1615 1620 ttc ttg gag gac ttc cca caa gat cca cgc acc aac ccc ctg gct 4914 Phe Leu Glu Asp Phe Pro Gln Asp Pro Arg Thr Asn Pro Leu Ala 1625 1630 1635 cgt gcc aat acc aac aat gcc aac gcc aat gtc gcc tat gcg aac 4959 Arg Ala Asn Thr Asn Asn Ala Asn Ala Asn Val Ala Tyr Ala Asn 1640 1645 1650 agc aac cat agc aac agc cat gtg ttt tcc agt gtc cac tat gaa 5004 Ser Asn His Ser Asn Ser His Val Phe Ser Ser Val His Tyr Glu 1655 1660 1665 agg gag ttc cca gaa gag aca gag acg cct gct acc aga gga cga 5049 Arg Glu Phe Pro Glu Glu Thr Glu Thr Pro Ala Thr Arg Gly Arg 1670 1675 1680 gcc ctt ggc caa ccc tgc agg tcc ctg gga ccc cac agc aaa ccc 5094 Ala Leu Gly Gln Pro Cys Arg Ser Leu Gly Pro His Ser Lys Pro 1685 1690 1695 tgt gtg gag atg ctg aag gga ctg ctg acc cag agg gca atg ccc 5139 Cys Val Glu Met Leu Lys Gly Leu Leu Thr Gln Arg Ala Met Pro 1700 1705 1710 aga ggc cag gca cct cct gcc ccc tgc cag tgc ccc agg gtg gag 5184 Arg Gly Gln Ala Pro Pro Ala Pro Cys Gln Cys Pro Arg Val Glu 1715 1720 1725 tcc tcc atg cct gag gac aga aag agc tcc aca cca ggg tct ctt 5229 Ser Ser Met Pro Glu Asp Arg Lys Ser Ser Thr Pro Gly Ser Leu 1730 1735 1740 cat gag gag aca ccc cac agc agg agc acc agg gag aat act tcc 5274 His Glu Glu Thr Pro His Ser Arg Ser Thr Arg Glu Asn Thr Ser 1745 1750 1755 agg tgc tca gca cca gct aca gcc ctg ctg atc caa aag gct ctg 5319 Arg Cys Ser Ala Pro Ala Thr Ala Leu Leu Ile Gln Lys Ala Leu 1760 1765 1770 gtt cga ggg ggc ctg ggc acc ttg gca gct gat gca aac ttc atc 5364 Val Arg Gly Gly Leu Gly Thr Leu Ala Ala Asp Ala Asn Phe Ile 1775 1780 1785 atg gca aca ggc cag gcc ctc gga gat gcc tgc caa atg gaa cca 5409 Met Ala Thr Gly Gln Ala Leu Gly Asp Ala Cys Gln Met Glu Pro 1790 1795 1800 gag gaa gtg gag atc atg gca aca gag cta ctg aaa gga cga gag 5454 Glu Glu Val Glu Ile Met Ala Thr Glu Leu Leu Lys Gly Arg Glu 1805 1810 1815 gcc cca gac ggc atg gcc agc tcc ctg gga tgc ctg aac ctc ggg 5499 Ala Pro Asp Gly Met Ala Ser Ser Leu Gly Cys Leu Asn Leu Gly 1820 1825 1830 tcc tcc ctg ggc agc ctc gac caa cac cag ggc tcc cag gag acc 5544 Ser Ser Leu Gly Ser Leu Asp Gln His Gln Gly Ser Gln Glu Thr 1835 1840 1845 ctt att cct cca agg ctg tga 5565 Leu Ile Pro Pro Arg Leu 1850 2 1854 PRT Homo sapiens 2 Met Glu Pro Ser Ser Pro Gln Asp Glu Gly Leu Arg Lys Lys Gln Pro 1 5 10 15 Lys Lys Pro Val Pro Glu Ile Leu Pro Arg Pro Pro Arg Ala Leu Phe 20 25 30 Cys Leu Thr Leu Glu Asn Pro Leu Arg Lys Ala Cys Ile Ser Ile Val 35 40 45 Glu Trp Lys Pro Phe Glu Thr Ile Ile Leu Leu Thr Ile Phe Ala Asn 50 55 60 Cys Val Ala Leu Ala Val Tyr Leu Pro Met Pro Glu Asp Asp Asn Asn 65 70 75 80 Ser Leu Asn Leu Gly Leu Glu Lys Leu Glu Tyr Phe Phe Leu Ile Val 85 90 95 Phe Ser Ile Glu Ala Ala Met Lys Ile Ile Ala Tyr Gly Phe Leu Phe 100 105 110 His Gln Asp Ala Tyr Leu Arg Ser Gly Trp Asn Val Leu Asp Phe Thr 115 120 125 Ile Val Phe Leu Gly Val Phe Thr Val Ile Leu Glu Gln Val Asn Val 130 135 140 Ile Gln Ser His Thr Ala Pro Met Ser Ser Lys Gly Ala Gly Leu Asp 145 150 155 160 Val Lys Ala Leu Arg Ala Phe Arg Val Leu Arg Pro Leu Arg Leu Val 165 170 175 Ser Gly Val Pro Ser Leu Gln Val Val Leu Asn Ser Ile Phe Lys Ala 180 185 190 Met Leu Pro Leu Phe His Ile Ala Leu Leu Val Leu Phe Met Val Ile 195 200 205 Ile Tyr Ala Ile Ile Gly Leu Glu Leu Phe Lys Gly Lys Met His Lys 210 215 220 Thr Cys Tyr Phe Ile Gly Thr Asp Ile Val Ala Thr Val Glu Asn Glu 225 230 235 240 Glu Pro Ser Pro Cys Ala Arg Thr Gly Ser Gly Arg Arg Cys Thr Ile 245 250 255 Asn Gly Ser Glu Cys Arg Gly Gly Cys Pro Gly Pro Asn His Gly Ile 260 265 270 Thr His Phe Asp Asn Phe Gly Phe Ser Met Leu Thr Val Tyr Gln Cys 275 280 285 Ile Thr Met Glu Gly Trp Thr Asp Val Leu Tyr Trp Val Asn Asp Ala 290 295 300 Ile Gly Asn Glu Trp Pro Trp Ile Tyr Phe Val Thr Leu Ile Leu Leu 305 310 315 320 Gly Ser Phe Phe Ile Leu Asn Leu Val Leu Gly Val Leu Ser Gly Glu 325 330 335 Phe Thr Lys Glu Arg Glu Lys Ala Lys Ser Arg Gly Thr Phe Gln Lys 340 345 350 Leu Arg Glu Lys Gln Gln Leu Asp Glu Asp Leu Arg Gly Tyr Met Ser 355 360 365 Trp Ile Thr Gln Gly Glu Val Met Asp Val Glu Asp Phe Arg Glu Gly 370 375 380 Lys Leu Ser Leu Asp Glu Gly Gly Ser Asp Thr Glu Ser Leu Tyr Glu 385 390 395 400 Ile Ala Gly Leu Asn Lys Ile Ile Gln Phe Ile Arg His Trp Arg Gln 405 410 415 Trp Asn Arg Ile Phe Arg Trp Lys Cys His Asp Ile Val Lys Ser Lys 420 425 430 Val Phe Tyr Trp Leu Val Ile Leu Ile Val Ala Leu Asn Thr Leu Ser 435 440 445 Ile Ala Ser Glu His His Asn Gln Pro His Trp Leu Thr Arg Leu Gln 450 455 460 Asp Ile Ala Asn Arg Val Leu Leu Ser Leu Phe Thr Thr Glu Met Leu 465 470 475 480 Met Lys Met Tyr Gly Leu Gly Leu Arg Gln Tyr Phe Met Ser Ile Phe 485 490 495 Asn Arg Phe Asp Cys Phe Val Val Cys Ser Gly Ile Leu Glu Ile Leu 500 505 510 Leu Val Glu Ser Gly Ala Met Thr Pro Leu Gly Ile Ser Val Leu Arg 515 520 525 Cys Ile Arg Leu Leu Arg Ile Phe Lys Ile Thr Lys Tyr Trp Thr Ser 530 535 540 Leu Ser Asn Leu Val Ala Ser Leu Leu Asn Ser Ile Arg Ser Ile Ala 545 550 555 560 Ser Leu Leu Leu Leu Leu Phe Leu Phe Ile Val Ile Phe Arg Leu Leu 565 570 575 Gly Met Gln Leu Phe Gly Gly Arg Tyr Asp Phe Glu Asp Thr Glu Val 580 585 590 Arg Arg Ser Asn Phe Asp Asn Phe Pro Gln Ala Leu Ile Ser Val Phe 595 600 605 Gln Val Leu Thr Gly Glu Asp Trp Thr Ser Met Met Tyr Asn Gly Ile 610 615 620 Met Ala Ser Ser Gly Pro Ser Tyr Pro Gly Met Leu Val Cys Ile Tyr 625 630 635 640 Phe Ile Ile Leu Phe Val Cys Gly Asn Tyr Ile Leu Leu Asn Val Phe 645 650 655 Leu Ala Ile Ala Val Asp Asn Leu Ala Glu Ala Glu Ser Leu Thr Ser 660 665 670 Ala Gln Lys Ala Lys Ala Glu Glu Lys Lys Arg Arg Lys Met Ser Lys 675 680 685 Gly Leu Pro Asp Lys Ser Glu Glu Glu Lys Ser Thr Met Ala Lys Lys 690 695 700 Leu Glu Gln Lys Pro Lys Gly Glu Gly Ile Pro Thr Thr Ala Lys Leu 705 710 715 720 Lys Ile Asp Glu Phe Glu Ser Asn Val Asn Glu Val Lys Asp Pro Tyr 725 730 735 Pro Ser Ala Asp Phe Pro Gly Asp Asp Glu Glu Asp Glu Pro Glu Ile 740 745 750 Pro Leu Ser Pro Arg Pro Arg Pro Leu Ala Glu Leu Gln Leu Lys Glu 755 760 765 Lys Ala Val Pro Ile Pro Glu Ala Ser Ser Phe Phe Ile Phe Ser Pro 770 775 780 Thr Asn Lys Ile Arg Val Leu Cys His Arg Ile Val Asn Ala Thr Trp 785 790 795 800 Phe Thr Asn Phe Ile Leu Leu Phe Ile Leu Leu Ser Ser Ala Ala Leu 805 810 815 Ala Ala Glu Asp Pro Ile Arg Ala Asp Ser Met Arg Asn Gln Ile Leu 820 825 830 Lys His Phe Asp Ile Gly Phe Thr Ser Val Phe Thr Val Glu Ile Val 835 840 845 Leu Lys Met Thr Thr Tyr Gly Ala Phe Leu His Lys Gly Ser Phe Cys 850 855 860 Arg Asn Tyr Phe Asn Met Leu Asp Leu Leu Val Val Ala Val Ser Leu 865 870 875 880 Ile Ser Met Gly Leu Glu Ser Ser Ala Ile Ser Val Val Lys Ile Leu 885 890 895 Arg Val Leu Arg Val Leu Arg Pro Leu Arg Ala Ile Asn Arg Ala Lys 900 905 910 Gly Leu Lys His Val Ala Arg Cys Met Phe Val Ala Ile Ser Thr Ile 915 920 925 Gly Asn Ile Val Leu Val Thr Thr Leu Leu Gln Phe Met Phe Ala Cys 930 935 940 Ile Gly Val Gln Leu Phe Lys Gly Lys Phe Phe Arg Cys Thr Asp Leu 945 950 955 960 Ser Lys Met Thr Glu Glu Glu Cys Arg Gly Tyr Tyr Tyr Val Tyr Lys 965 970 975 Asp Gly Asp Pro Met Gln Ile Glu Leu Arg His Arg Glu Trp Val His 980 985 990 Ser Asp Phe His Phe Asp Asn Val Leu Ser Ala Met Met Ser Leu Phe 995 1000 1005 Thr Val Ser Thr Phe Glu Gly Trp Pro Gln Leu Leu Tyr Lys Ala 1010 1015 1020 Ile Asp Ser Asn Ala Glu Asp Val Gly Pro Ile Tyr Asn Asn Arg 1025 1030 1035 Val Glu Met Ala Ile Phe Phe Ile Ile Tyr Ile Ile Leu Ile Ala 1040 1045 1050 Phe Phe Met Met Asn Ile Phe Val Gly Phe Val Ile Val Thr Phe 1055 1060 1065 Gln Glu Gln Gly Glu Thr Glu Tyr Lys Asn Cys Glu Leu Asp Lys 1070 1075 1080 Asn Gln Arg Gln Cys Val Gln Tyr Ala Leu Lys Ala Arg Pro Leu 1085 1090 1095 Arg Cys Tyr Ile Pro Lys Asn Pro Tyr Gln Tyr Gln Val Trp Tyr 1100 1105 1110 Ile Val Thr Ser Ser Tyr Phe Glu Tyr Leu Met Phe Ala Leu Ile 1115 1120 1125 Met Leu Asn Thr Ile Cys Leu Gly Met Gln His Tyr Asn Gln Ser 1130 1135 1140 Glu Gln Met Asn His Ile Ser Asp Ile Leu Asn Val Ala Phe Thr 1145 1150 1155 Ile Ile Phe Thr Leu Glu Met Ile Leu Lys Leu Met Ala Phe Lys 1160 1165 1170 Ala Arg Gly Tyr Phe Gly Asn Pro Trp Asn Val Phe Asp Phe Leu 1175 1180 1185 Ile Val Ile Gly Ser Ile Ile Asp Val Ile Leu Ser Glu Ile Asp 1190 1195 1200 Asp Pro Asp Glu Ser Ala Arg Ile Ser Ser Ala Phe Phe Arg Leu 1205 1210 1215 Phe Arg Val Met Arg Leu Ile Lys Leu Leu Ser Arg Ala Glu Gly 1220 1225 1230 Val Arg Thr Leu Leu Trp Thr Phe Ile Lys Ser Phe Gln Ala Leu 1235 1240 1245 Pro Tyr Val Ala Leu Leu Ile Val Met Leu Phe Phe Ile Tyr Ala 1250 1255 1260 Val Ile Gly Met Gln Met Phe Gly Lys Ile Ala Leu Val Asp Gly 1265 1270 1275 Thr Gln Ile Asn Arg Asn Asn Asn Phe Gln Thr Phe Pro Gln Ala 1280 1285 1290 Val Leu Leu Leu Phe Arg Cys Ala Thr Gly Glu Ala Trp Gln Glu 1295 1300 1305 Ile Leu Leu Ala Cys Ser Tyr Gly Lys Leu Cys Asp Pro Glu Ser 1310 1315 1320 Asp Tyr Ala Pro Gly Glu Glu Tyr Thr Cys Gly Thr Asn Phe Ala 1325 1330 1335 Tyr Tyr Tyr Phe Ile Ser Phe Tyr Met Leu Cys Ala Phe Leu Val 1340 1345 1350 Ile Asn Leu Phe Val Ala Val Ile Met Asp Asn Phe Asp Tyr Leu 1355 1360 1365 Thr Arg Asp Trp Ser Ile Leu Gly Pro His His Leu Asp Glu Phe 1370 1375 1380 Lys Ala Ile Trp Ala Glu Tyr Asp Pro Glu Ala Lys Gly Arg Ile 1385 1390 1395 Lys His Leu Asp Val Val Thr Leu Leu Arg Arg Ile Gln Pro Pro 1400 1405 1410 Leu Gly Phe Gly Lys Phe Cys Pro His Arg Val Ala Cys Lys Arg 1415 1420 1425 Leu Val Gly Met Asn Met Pro Leu Asn Ser Asp Gly Thr Val Thr 1430 1435 1440 Phe Asn Ala Thr Leu Phe Ala Leu Val Arg Thr Ala Leu Lys Ile 1445 1450 1455 Lys Thr Glu Gly Asn Phe Glu Gln Ala Asn Glu Glu Leu Arg Ala 1460 1465 1470 Ile Ile Lys Lys Ile Trp Lys Arg Thr Ser Met Lys Leu Leu Asp 1475 1480 1485 Gln Val Ile Pro Pro Ile Gly Asp Asp Glu Val Thr Val Gly Lys 1490 1495 1500 Phe Tyr Ala Thr Phe Leu Ile Gln Glu His Phe Arg Lys Phe Met 1505 1510 1515 Lys Arg Gln Glu Glu Tyr Tyr Gly Tyr Arg Pro Lys Lys Asp Ile 1520 1525 1530 Val Gln Ile Gln Ala Gly Leu Arg Thr Ile Glu Glu Glu Ala Ala 1535 1540 1545 Pro Glu Ile Cys Arg Thr Val Ser Gly Asp Leu Ala Ala Glu Glu 1550 1555 1560 Glu Leu Glu Arg Ala Met Val Glu Ala Ala Met Glu Glu Gly Ile 1565 1570 1575 Phe Arg Arg Thr Gly Gly Leu Phe Gly Gln Val Asp Asn Phe Leu 1580 1585 1590 Glu Arg Thr Asn Ser Leu Pro Pro Val Met Ala Asn Gln Arg Pro 1595 1600 1605 Leu Gln Phe Ala Glu Ile Glu Met Glu Glu Met Glu Ser Pro Val 1610 1615 1620 Phe Leu Glu Asp Phe Pro Gln Asp Pro Arg Thr Asn Pro Leu Ala 1625 1630 1635 Arg Ala Asn Thr Asn Asn Ala Asn Ala Asn Val Ala Tyr Ala Asn 1640 1645 1650 Ser Asn His Ser Asn Ser His Val Phe Ser Ser Val His Tyr Glu 1655 1660 1665 Arg Glu Phe Pro Glu Glu Thr Glu Thr Pro Ala Thr Arg Gly Arg 1670 1675 1680 Ala Leu Gly Gln Pro Cys Arg Ser Leu Gly Pro His Ser Lys Pro 1685 1690 1695 Cys Val Glu Met Leu Lys Gly Leu Leu Thr Gln Arg Ala Met Pro 1700 1705 1710 Arg Gly Gln Ala Pro Pro Ala Pro Cys Gln Cys Pro Arg Val Glu 1715 1720 1725 Ser Ser Met Pro Glu Asp Arg Lys Ser Ser Thr Pro Gly Ser Leu 1730 1735 1740 His Glu Glu Thr Pro His Ser Arg Ser Thr Arg Glu Asn Thr Ser 1745 1750 1755 Arg Cys Ser Ala Pro Ala Thr Ala Leu Leu Ile Gln Lys Ala Leu 1760 1765 1770 Val Arg Gly Gly Leu Gly Thr Leu Ala Ala Asp Ala Asn Phe Ile 1775 1780 1785 Met Ala Thr Gly Gln Ala Leu Gly Asp Ala Cys Gln Met Glu Pro 1790 1795 1800 Glu Glu Val Glu Ile Met Ala Thr Glu Leu Leu Lys Gly Arg Glu 1805 1810 1815 Ala Pro Asp Gly Met Ala Ser Ser Leu Gly Cys Leu Asn Leu Gly 1820 1825 1830 Ser Ser Leu Gly Ser Leu Asp Gln His Gln Gly Ser Gln Glu Thr 1835 1840 1845 Leu Ile Pro Pro Arg Leu 1850 3 6501 DNA Homo sapiens CDS (1)..(6501) 3 atg atg atg atg atg atg atg aaa aaa atg cag cat caa cgg cag cag 48 Met Met Met Met Met Met Met Lys Lys Met Gln His Gln Arg Gln Gln 1 5 10 15 caa gcg gac cac gcg aac gag gca aac tat gca aga ggc acc aga ctt 96 Gln Ala Asp His Ala Asn Glu Ala Asn Tyr Ala Arg Gly Thr Arg Leu 20 25 30 cct ctt tct ggt gaa gga cca act tct cag ccg aat agc tcc aag caa 144 Pro Leu Ser Gly Glu Gly Pro Thr Ser Gln Pro Asn Ser Ser Lys Gln 35 40 45 act gtc ctg tct tgg caa gct gca atc gat gct gct aga cag gcc aag 192 Thr Val Leu Ser Trp Gln Ala Ala Ile Asp Ala Ala Arg Gln Ala Lys 50 55 60 gct gcc caa act atg agc acc tct gca ccc cca cct gta gga tct ctc 240 Ala Ala Gln Thr Met Ser Thr Ser Ala Pro Pro Pro Val Gly Ser Leu 65 70 75 80 tcc caa aga aaa cgt cag caa tac gcc aag agc aaa aaa cag ggt aac 288 Ser Gln Arg Lys Arg Gln Gln Tyr Ala Lys Ser Lys Lys Gln Gly Asn 85 90 95 tcg tcc aac agc cga cct gcc cgc gcc ctt ttc tgt tta tca ctc aat 336 Ser Ser Asn Ser Arg Pro Ala Arg Ala Leu Phe Cys Leu Ser Leu Asn 100 105 110 aac ccc atc cga aga gcc tgc att agt ata gtg gaa tgg aaa cca ttt 384 Asn Pro Ile Arg Arg Ala Cys Ile Ser Ile Val Glu Trp Lys Pro Phe 115 120 125 gac ata ttt ata tta ttg gct att ttt gcc aat tgt gtg gcc tta gct 432 Asp Ile Phe Ile Leu Leu Ala Ile Phe Ala Asn Cys Val Ala Leu Ala 130 135 140 att tac atc cca ttc cct gaa gat gat tct aat tca aca aat cat aac 480 Ile Tyr Ile Pro Phe Pro Glu Asp Asp Ser Asn Ser Thr Asn His Asn 145 150 155 160 ttg gaa aaa gta gaa tat gcc ttc ctg att att ttt aca gtc gag aca 528 Leu Glu Lys Val Glu Tyr Ala Phe Leu Ile Ile Phe Thr Val Glu Thr 165 170 175 ttt ttg aag att ata gcg tat gga tta ttg cta cat cct aat gct tat 576 Phe Leu Lys Ile Ile Ala Tyr Gly Leu Leu Leu His Pro Asn Ala Tyr 180 185 190 gtt agg aat gga tgg aat tta ctg gat ttt gtt ata gta ata gta gga 624 Val Arg Asn Gly Trp Asn Leu Leu Asp Phe Val Ile Val Ile Val Gly 195 200 205 ttg ttt agt gta att ttg gaa caa tta acc aaa gaa aca gaa ggc ggg 672 Leu Phe Ser Val Ile Leu Glu Gln Leu Thr Lys Glu Thr Glu Gly Gly 210 215 220 aac cac tca agc ggc aaa tct gga ggc ttt gat gtc aaa gcc ctc cgt 720 Asn His Ser Ser Gly Lys Ser Gly Gly Phe Asp Val Lys Ala Leu Arg 225 230 235 240 gcc ttt cga gtg ttg cga cca ctt cga cta gtg tca ggg gtg ccc agt 768 Ala Phe Arg Val Leu Arg Pro Leu Arg Leu Val Ser Gly Val Pro Ser 245 250 255 tta caa gtt gtc ctg aac tcc att ata aaa gcc atg gtt ccc ctc ctt 816 Leu Gln Val Val Leu Asn Ser Ile Ile Lys Ala Met Val Pro Leu Leu 260 265 270 cac ata gcc ctt ttg gta tta ttt gta atc ata atc tat gct att ata 864 His Ile Ala Leu Leu Val Leu Phe Val Ile Ile Ile Tyr Ala Ile Ile 275 280 285 gga ttg gaa ctt ttt att gga aaa atg cac aaa aca tgt ttt ttt gct 912 Gly Leu Glu Leu Phe Ile Gly Lys Met His Lys Thr Cys Phe Phe Ala 290 295 300 gac tca gat atc gta gct gaa gag gac cca gct cca tgt gcg ttc tca 960 Asp Ser Asp Ile Val Ala Glu Glu Asp Pro Ala Pro Cys Ala Phe Ser 305 310 315 320 ggg aat gga cgc cag tgt act gcc aat ggc acg gaa tgt agg agt ggc 1008 Gly Asn Gly Arg Gln Cys Thr Ala Asn Gly Thr Glu Cys Arg Ser Gly 325 330 335 tgg gtt ggc ccg aac gga ggc atc acc aac ttt gat aac ttt gcc ttt 1056 Trp Val Gly Pro Asn Gly Gly Ile Thr Asn Phe Asp Asn Phe Ala Phe 340 345 350 gcc atg ctt act gtg ttt cag tgc atc acc atg gag ggc tgg aca gac 1104 Ala Met Leu Thr Val Phe Gln Cys Ile Thr Met Glu Gly Trp Thr Asp 355 360 365 gtg ctc tac tgg gta aat gat gcg ata gga tgg gaa tgg cca tgg gtg 1152 Val Leu Tyr Trp Val Asn Asp Ala Ile Gly Trp Glu Trp Pro Trp Val 370 375 380 tat ttt gtt agt ctg atc atc ctt ggc tca ttt ttc gtc ctt aac ctg 1200 Tyr Phe Val Ser Leu Ile Ile Leu Gly Ser Phe Phe Val Leu Asn Leu 385 390 395 400 gtt ctt ggt gtc ctt agt gga gaa ttc tca aag gaa aga gag aag gca 1248 Val Leu Gly Val Leu Ser Gly Glu Phe Ser Lys Glu Arg Glu Lys Ala 405 410 415 aaa gca cgg gga gat ttc cag aag ctc cgg gag aag cag cag ctg gag 1296 Lys Ala Arg Gly Asp Phe Gln Lys Leu Arg Glu Lys Gln Gln Leu Glu 420 425 430 gag gat cta aag ggc tac ttg gat tgg atc acc caa gct gag gac atc 1344 Glu Asp Leu Lys Gly Tyr Leu Asp Trp Ile Thr Gln Ala Glu Asp Ile 435 440 445 gat ccg gag aat gag gaa gaa gga gga gag gaa ggc aaa cga aat act 1392 Asp Pro Glu Asn Glu Glu Glu Gly Gly Glu Glu Gly Lys Arg Asn Thr 450 455 460 agc atg ccc acc agc gag act gag tct gtg aac aca gag aac gtc agc 1440 Ser Met Pro Thr Ser Glu Thr Glu Ser Val Asn Thr Glu Asn Val Ser 465 470 475 480 ggt gaa ggc gag aac cga ggc tgc tgt gga agt ctc tgg tgc tgg tgg 1488 Gly Glu Gly Glu Asn Arg Gly Cys Cys Gly Ser Leu Trp Cys Trp Trp 485 490 495 aga cgg aga ggc gcg gcc aag gcg ggg ccc tct ggg tgt cgg cgg tgg 1536 Arg Arg Arg Gly Ala Ala Lys Ala Gly Pro Ser Gly Cys Arg Arg Trp 500 505 510 ggt caa gcc atc tca aaa tcc aaa ctc agc cga cgc tgg cgt cgc tgg 1584 Gly Gln Ala Ile Ser Lys Ser Lys Leu Ser Arg Arg Trp Arg Arg Trp 515 520 525 aac cga ttc aat cgc aga aga tgt agg gcc gcc gtg aag tct gtc acg 1632 Asn Arg Phe Asn Arg Arg Arg Cys Arg Ala Ala Val Lys Ser Val Thr 530 535 540 ttt tac tgg ctg gtt atc gtc ctg gtg ttt ctg aac acc tta acc att 1680 Phe Tyr Trp Leu Val Ile Val Leu Val Phe Leu Asn Thr Leu Thr Ile 545 550 555 560 tcc tct gag cac tac aat cag cca gat tgg ttg aca cag att caa gat 1728 Ser Ser Glu His Tyr Asn Gln Pro Asp Trp Leu Thr Gln Ile Gln Asp 565 570 575 att gcc aac aaa gtc ctc ttg gct ctg ttc acc tgc gag atg ctg gta 1776 Ile Ala Asn Lys Val Leu Leu Ala Leu Phe Thr Cys Glu Met Leu Val 580 585 590 aaa atg tac agc ttg ggc ctc caa gca tat ttc gtc tct ctt ttc aac 1824 Lys Met Tyr Ser Leu Gly Leu Gln Ala Tyr Phe Val Ser Leu Phe Asn 595 600 605 cgg ttt gat tgc ttc gtg gtg tgt ggt gga atc act gag acg atc ctg 1872 Arg Phe Asp Cys Phe Val Val Cys Gly Gly Ile Thr Glu Thr Ile Leu 610 615 620 gtg gaa ctg gaa atc atg tct ccc ctg ggg atc tct gtg ttt cgg tgt 1920 Val Glu Leu Glu Ile Met Ser Pro Leu Gly Ile Ser Val Phe Arg Cys 625 630 635 640 gtg cgc ctc tta aga atc ttc aaa gtg acc agg cac tgg act tcc ctg 1968 Val Arg Leu Leu Arg Ile Phe Lys Val Thr Arg His Trp Thr Ser Leu 645 650 655 agc aac tta gtg gca tcc tta tta aac tcc atg aag tcc atc gct tcg 2016 Ser Asn Leu Val Ala Ser Leu Leu Asn Ser Met Lys Ser Ile Ala Ser 660 665 670 ctg ttg ctt ctg ctt ttt ctc ttc att atc atc ttt tcc ttg ctt ggg 2064 Leu Leu Leu Leu Leu Phe Leu Phe Ile Ile Ile Phe Ser Leu Leu Gly 675 680 685 atg cag ctg ttt ggc ggc aag ttt aat ttt gat gaa acg caa acc aag 2112 Met Gln Leu Phe Gly Gly Lys Phe Asn Phe Asp Glu Thr Gln Thr Lys 690 695 700 cgg agc acc ttt gac aat ttc cct caa gca ctt ctc aca gtg ttc cag 2160 Arg Ser Thr Phe Asp Asn Phe Pro Gln Ala Leu Leu Thr Val Phe Gln 705 710 715 720 atc ctg aca ggc gaa gac tgg aat gct gtg atg tac gat ggc atc atg 2208 Ile Leu Thr Gly Glu Asp Trp Asn Ala Val Met Tyr Asp Gly Ile Met 725 730 735 gct tac ggg ggc cca tcc tct tca gga atg atc gtc tgc atc tac ttc 2256 Ala Tyr Gly Gly Pro Ser Ser Ser Gly Met Ile Val Cys Ile Tyr Phe 740 745 750 atc atc ctc ttc att tgt ggt aac tat att cta ctg aat gtc ttc ttg 2304 Ile Ile Leu Phe Ile Cys Gly Asn Tyr Ile Leu Leu Asn Val Phe Leu 755 760 765 gcc atc gct gta gac aat ttg gct gat gct gaa agt ctg aac act gct 2352 Ala Ile Ala Val Asp Asn Leu Ala Asp Ala Glu Ser Leu Asn Thr Ala 770 775 780 cag aaa gaa gaa gcg gaa gaa aag gag agg aaa aag att gcc aga aaa 2400 Gln Lys Glu Glu Ala Glu Glu Lys Glu Arg Lys Lys Ile Ala Arg Lys 785 790 795 800 gag agc cta gaa aat aaa aag aac aac aaa cca gaa gtc aac cag ata 2448 Glu Ser Leu Glu Asn Lys Lys Asn Asn Lys Pro Glu Val Asn Gln Ile 805 810 815 gcc aac agt gac aac aag gtt aca att gat gac tat aga gaa gag gat 2496 Ala Asn Ser Asp Asn Lys Val Thr Ile Asp Asp Tyr Arg Glu Glu Asp 820 825 830 gaa gac aag gac ccc tat ccg cct tgc gat gtg cca gta ggg gaa gag 2544 Glu Asp Lys Asp Pro Tyr Pro Pro Cys Asp Val Pro Val Gly Glu Glu 835 840 845 gaa gag gaa gag gag gag gat gaa cct gag gtt cct gcc gga ccc cgt 2592 Glu Glu Glu Glu Glu Glu Asp Glu Pro Glu Val Pro Ala Gly Pro Arg 850 855 860 cct cga agg atc tcg gag ttg aac atg aag gaa aaa att gcc ccc atc 2640 Pro Arg Arg Ile Ser Glu Leu Asn Met Lys Glu Lys Ile Ala Pro Ile 865 870 875 880 cct gaa ggg agc gct ttc ttc att ctt agc aag acc aac ccg atc cgc 2688 Pro Glu Gly Ser Ala Phe Phe Ile Leu Ser Lys Thr Asn Pro Ile Arg 885 890 895 gta ggc tgc cac aag ctc atc aac cac cac atc ttc acc aac ctc atc 2736 Val Gly Cys His Lys Leu Ile Asn His His Ile Phe Thr Asn Leu Ile 900 905 910 ctt gtc ttc atc atg ctg agc agc gct gcc ctg gcc gca gag gac ccc 2784 Leu Val Phe Ile Met Leu Ser Ser Ala Ala Leu Ala Ala Glu Asp Pro 915 920 925 atc cgc agc cac tcc ttc cgg aac acg ata ctg ggt tac ttt gac tat 2832 Ile Arg Ser His Ser Phe Arg Asn Thr Ile Leu Gly Tyr Phe Asp Tyr 930 935 940 gcc ttc aca gcc atc ttt act gtt gag atc ctg ttg aag atg aca act 2880 Ala Phe Thr Ala Ile Phe Thr Val Glu Ile Leu Leu Lys Met Thr Thr 945 950 955 960 ttt gga gct ttc ctc cac aaa ggg gcc ttc tgc agg aac tac ttc aat 2928 Phe Gly Ala Phe Leu His Lys Gly Ala Phe Cys Arg Asn Tyr Phe Asn 965 970 975 ttg ctg gat atg ctg gtg gtt ggg gtg tct ctg gtg tca ttt ggg att 2976 Leu Leu Asp Met Leu Val Val Gly Val Ser Leu Val Ser Phe Gly Ile 980 985 990 caa tcc agt gcc atc tcc gtt gtg aag att ctg agg gtc tta agg gtc 3024 Gln Ser Ser Ala Ile Ser Val Val Lys Ile Leu Arg Val Leu Arg Val 995 1000 1005 ctg cgt ccc ctc agg gcc atc aac aga gca aaa gga ctt aag cac 3069 Leu Arg Pro Leu Arg Ala Ile Asn Arg Ala Lys Gly Leu Lys His 1010 1015 1020 gtg gtc cag tgc gtc ttc gtg gcc atc cgg acc atc ggc aac atc 3114 Val Val Gln Cys Val Phe Val Ala Ile Arg Thr Ile Gly Asn Ile 1025 1030 1035 atg atc gtc act acc ctc ctg cag ttc atg ttt gcc tgt atc ggg 3159 Met Ile Val Thr Thr Leu Leu Gln Phe Met Phe Ala Cys Ile Gly 1040 1045 1050 gtc cag ttg ttc aag ggg aag ttc tat cgc tgt acg gat gaa gcc 3204 Val Gln Leu Phe Lys Gly Lys Phe Tyr Arg Cys Thr Asp Glu Ala 1055 1060 1065 aaa agt aac cct gaa gaa tgc agg gga ctt ttc atc ctc tac aag 3249 Lys Ser Asn Pro Glu Glu Cys Arg Gly Leu Phe Ile Leu Tyr Lys 1070 1075 1080 gat ggg gat gtt gac agt cct gtg gtc cgt gaa cgg atc tgg caa 3294 Asp Gly Asp Val Asp Ser Pro Val Val Arg Glu Arg Ile Trp Gln 1085 1090 1095 aac agt gat ttc aac ttc gac aac gtc ctc tct gct atg atg gcg 3339 Asn Ser Asp Phe Asn Phe Asp Asn Val Leu Ser Ala Met Met Ala 1100 1105 1110 ctc ttc aca gtc tcc acg ttt gag ggc tgg cct gcg ttg ctg tat 3384 Leu Phe Thr Val Ser Thr Phe Glu Gly Trp Pro Ala Leu Leu Tyr 1115 1120 1125 aaa gcc atc gac tcg aat gga gag aac atc ggc cca atc tac aac 3429 Lys Ala Ile Asp Ser Asn Gly Glu Asn Ile Gly Pro Ile Tyr Asn 1130 1135 1140 cac cgc gtg gag atc tcc atc ttc ttc atc atc tac atc atc att 3474 His Arg Val Glu Ile Ser Ile Phe Phe Ile Ile Tyr Ile Ile Ile 1145 1150 1155 gta gct ttc ttc atg atg aac atc ttt gtg ggc ttt gtc atc gtt 3519 Val Ala Phe Phe Met Met Asn Ile Phe Val Gly Phe Val Ile Val 1160 1165 1170 aca ttt cag gaa caa gga gaa aaa gag tat aag aac tgt gag ctg 3564 Thr Phe Gln Glu Gln Gly Glu Lys Glu Tyr Lys Asn Cys Glu Leu 1175 1180 1185 gac aaa aat cag cgt cag tgt gtt gaa tac gcc ttg aaa gca cgt 3609 Asp Lys Asn Gln Arg Gln Cys Val Glu Tyr Ala Leu Lys Ala Arg 1190 1195 1200 ccc ttg cgg aga tac atc ccc aaa aac ccc tac cag tac aag ttc 3654 Pro Leu Arg Arg Tyr Ile Pro Lys Asn Pro Tyr Gln Tyr Lys Phe 1205 1210 1215 tgg tac gtg gtg aac tct tcg cct ttc gaa tac atg atg ttt gtc 3699 Trp Tyr Val Val Asn Ser Ser Pro Phe Glu Tyr Met Met Phe Val 1220 1225 1230 ctc atc atg ctc aac aca ctc tgc ttg gcc atg cag cac tac gag 3744 Leu Ile Met Leu Asn Thr Leu Cys Leu Ala Met Gln His Tyr Glu 1235 1240 1245 cag tcc aag atg ttc aat gat gcc atg gac att ctg aac atg gtc 3789 Gln Ser Lys Met Phe Asn Asp Ala Met Asp Ile Leu Asn Met Val 1250 1255 1260 ttc acc ggg gtg ttc acc gtc gag atg gtt ttg aaa gtc atc gca 3834 Phe Thr Gly Val Phe Thr Val Glu Met Val Leu Lys Val Ile Ala 1265 1270 1275 ttt aag cct aag ggg tat ttt agt gac gcc tgg aac acg ttt gac 3879 Phe Lys Pro Lys Gly Tyr Phe Ser Asp Ala Trp Asn Thr Phe Asp 1280 1285 1290 tcc ctc atc gta atc ggc agc att ata gac gtg gcc ctc agc gaa 3924 Ser Leu Ile Val Ile Gly Ser Ile Ile Asp Val Ala Leu Ser Glu 1295 1300 1305 gcg gac aac tct gaa gag agc aat aga atc tcc atc acc ttt ttc 3969 Ala Asp Asn Ser Glu Glu Ser Asn Arg Ile Ser Ile Thr Phe Phe 1310 1315 1320 cgt ctt ttc cga gtg atg cga ttg gtg aag ctt ctc agc agg ggg 4014 Arg Leu Phe Arg Val Met Arg Leu Val Lys Leu Leu Ser Arg Gly 1325 1330 1335 gaa ggc atc cgg aca ttg ctg tgg act ttt att aag tcc ttt cag 4059 Glu Gly Ile Arg Thr Leu Leu Trp Thr Phe Ile Lys Ser Phe Gln 1340 1345 1350 gcg ctc ccg tat gtg gcc ctc ctc ata gcc atg ctg ttc ttc atc 4104 Ala Leu Pro Tyr Val Ala Leu Leu Ile Ala Met Leu Phe Phe Ile 1355 1360 1365 tat gcg gtc att ggc atg cag atg ttt ggg aaa gtt gcc atg aga 4149 Tyr Ala Val Ile Gly Met Gln Met Phe Gly Lys Val Ala Met Arg 1370 1375 1380 gat aac aac cag atc aat agg aac aat aac ttc cag acg ttt ccc 4194 Asp Asn Asn Gln Ile Asn Arg Asn Asn Asn Phe Gln Thr Phe Pro 1385 1390 1395 cag gcg gtg ctg ctg ctc ttc agg tgt gca aca ggt gag gcc tgg 4239 Gln Ala Val Leu Leu Leu Phe Arg Cys Ala Thr Gly Glu Ala Trp 1400 1405 1410 cag gag atc atg ctg gcc tgt ctc cca ggg aag ctc tgt gac cct 4284 Gln Glu Ile Met Leu Ala Cys Leu Pro Gly Lys Leu Cys Asp Pro 1415 1420 1425 gag tca gat tac aac ccc ggg gag gag tat aca tgt ggg agc aac 4329 Glu Ser Asp Tyr Asn Pro Gly Glu Glu Tyr Thr Cys Gly Ser Asn 1430 1435 1440 ttt gcc att gtc tat ttc atc agt ttt tac atg ctc tgt gca ttt 4374 Phe Ala Ile Val Tyr Phe Ile Ser Phe Tyr Met Leu Cys Ala Phe 1445 1450 1455 ctg atc atc aat ctg ttt gtg gct gtc atc atg gat aat ttc gac 4419 Leu Ile Ile Asn Leu Phe Val Ala Val Ile Met Asp Asn Phe Asp 1460 1465 1470 tat ctg acc cgg gac tgg tct att ttg ggg cct cac cat tta gat 4464 Tyr Leu Thr Arg Asp Trp Ser Ile Leu Gly Pro His His Leu Asp 1475 1480 1485 gaa ttc aaa aga ata tgg tca gaa tat gac cct gag gca aag gga 4509 Glu Phe Lys Arg Ile Trp Ser Glu Tyr Asp Pro Glu Ala Lys Gly 1490 1495 1500 agg ata aaa cac ctt gat gtg gtc act ctg ctt cga cgc atc cag 4554 Arg Ile Lys His Leu Asp Val Val Thr Leu Leu Arg Arg Ile Gln 1505 1510 1515 cct ccc ctg ggg ttt ggg aag tta tgt cca cac agg gta gcg tgc 4599 Pro Pro Leu Gly Phe Gly Lys Leu Cys Pro His Arg Val Ala Cys 1520 1525 1530 aag aga tta gtt gcc atg aac atg cct ctc aac agt gac ggg aca 4644 Lys Arg Leu Val Ala Met Asn Met Pro Leu Asn Ser Asp Gly Thr 1535 1540 1545 gtc atg ttt aat gca acc ctg ttt gct ttg gtt cga acg gct ctt 4689 Val Met Phe Asn Ala Thr Leu Phe Ala Leu Val Arg Thr Ala Leu 1550 1555 1560 aag atc aag acc gaa ggg aac ctg gag caa gct aat gaa gaa ctt 4734 Lys Ile Lys Thr Glu Gly Asn Leu Glu Gln Ala Asn Glu Glu Leu 1565 1570 1575 cgg gct gtg ata aag aaa att tgg aag aaa acc agc atg aaa tta 4779 Arg Ala Val Ile Lys Lys Ile Trp Lys Lys Thr Ser Met Lys Leu 1580 1585 1590 ctt gac caa gtt gtc cct cca gct ggt gat gat gag gta acc gtg 4824 Leu Asp Gln Val Val Pro Pro Ala Gly Asp Asp Glu Val Thr Val 1595 1600 1605 ggg aag ttc tat gcc act ttc ctg ata cag gac tac ttt agg aaa 4869 Gly Lys Phe Tyr Ala Thr Phe Leu Ile Gln Asp Tyr Phe Arg Lys 1610 1615 1620 ttc aag aaa cgg aaa gaa caa gga ctg gtg gga aag tac cct gcg 4914 Phe Lys Lys Arg Lys Glu Gln Gly Leu Val Gly Lys Tyr Pro Ala 1625 1630 1635 aag aac acc aca att gcc cta cag gcg gga tta agg aca ctg cat 4959 Lys Asn Thr Thr Ile Ala Leu Gln Ala Gly Leu Arg Thr Leu His 1640 1645 1650 gac att ggg cca gaa atc cgg cgt gct ata tcg tgt gat ttg caa 5004 Asp Ile Gly Pro Glu Ile Arg Arg Ala Ile Ser Cys Asp Leu Gln 1655 1660 1665 gat gac gag cct gag gaa aca aaa cga gaa gaa gaa gat gat gtg 5049 Asp Asp Glu Pro Glu Glu Thr Lys Arg Glu Glu Glu Asp Asp Val 1670 1675 1680 ttc aaa aga aat ggt gcc ctg ctt gga aac cat gtc aat cat gtt 5094 Phe Lys Arg Asn Gly Ala Leu Leu Gly Asn His Val Asn His Val 1685 1690 1695 aat agt gat agg aga gat tcc ctt cag cag acc aat acc acc cac 5139 Asn Ser Asp Arg Arg Asp Ser Leu Gln Gln Thr Asn Thr Thr His 1700 1705 1710 cgt ccc ctg cat gtc caa agg cct tca att cca cct gca agt gat 5184 Arg Pro Leu His Val Gln Arg Pro Ser Ile Pro Pro Ala Ser Asp 1715 1720 1725 act gag aaa ccg ctg ttt cct cca gca gga aat tcg gtg tgt cat 5229 Thr Glu Lys Pro Leu Phe Pro Pro Ala Gly Asn Ser Val Cys His 1730 1735 1740 aac cat cat aac cat aat tcc ata gga aag caa gtt ccc acc tca 5274 Asn His His Asn His Asn Ser Ile Gly Lys Gln Val Pro Thr Ser 1745 1750 1755 aca aat gcc aat ctc aat aat gcc aat atg tcc aaa gct gcc cat 5319 Thr Asn Ala Asn Leu Asn Asn Ala Asn Met Ser Lys Ala Ala His 1760 1765 1770 gga aag cgg ccc agc att ggg aac ctt gag cat gtg tct gaa aat 5364 Gly Lys Arg Pro Ser Ile Gly Asn Leu Glu His Val Ser Glu Asn 1775 1780 1785 ggg cat cat tct tcc cac aag cat gac cgg gag cct cag aga agg 5409 Gly His His Ser Ser His Lys His Asp Arg Glu Pro Gln Arg Arg 1790 1795 1800 tcc agt gtg aaa aga acc cgc tat tat gaa act tac att agg tcc 5454 Ser Ser Val Lys Arg Thr Arg Tyr Tyr Glu Thr Tyr Ile Arg Ser 1805 1810 1815 gac tca gga gat gaa cag ctc cca act att tgc cgg gaa gac cca 5499 Asp Ser Gly Asp Glu Gln Leu Pro Thr Ile Cys Arg Glu Asp Pro 1820 1825 1830 gag ata cat ggc tat ttc agg gac ccc cac tgc ttg ggg gag cag 5544 Glu Ile His Gly Tyr Phe Arg Asp Pro His Cys Leu Gly Glu Gln 1835 1840 1845 gag tat ttc agt agt gag gaa tgc tac gag gat gac agc tcg ccc 5589 Glu Tyr Phe Ser Ser Glu Glu Cys Tyr Glu Asp Asp Ser Ser Pro 1850 1855 1860 acc tgg agc agg caa aac tat ggc tac tac agc aga tac cca ggc 5634 Thr Trp Ser Arg Gln Asn Tyr Gly Tyr Tyr Ser Arg Tyr Pro Gly 1865 1870 1875 aga aac atc gac tct gag agg ccc cga ggc tac cat cat ccc caa 5679 Arg Asn Ile Asp Ser Glu Arg Pro Arg Gly Tyr His His Pro Gln 1880 1885 1890 gga ttc ttg gag gac gat gac tcg ccc gtt tgc tat gat tca cgg 5724 Gly Phe Leu Glu Asp Asp Asp Ser Pro Val Cys Tyr Asp Ser Arg 1895 1900 1905 aga tct cca agg aga cgc cta cta cct ccc acc cca gca tcc cac 5769 Arg Ser Pro Arg Arg Arg Leu Leu Pro Pro Thr Pro Ala Ser His 1910 1915 1920 cgg aga tcc tcc ttc aac ttt gag tgc ctg cgc cgg cag agc agc 5814 Arg Arg Ser Ser Phe Asn Phe Glu Cys Leu Arg Arg Gln Ser Ser 1925 1930 1935 cag gaa gag gtc ccg tcg tct ccc atc ttc ccc cat cgc acg gcc 5859 Gln Glu Glu Val Pro Ser Ser Pro Ile Phe Pro His Arg Thr Ala 1940 1945 1950 ctg cct ctg cat cta atg cag caa cag atc atg gca gtt gcc ggc 5904 Leu Pro Leu His Leu Met Gln Gln Gln Ile Met Ala Val Ala Gly 1955 1960 1965 cta gat tca agt aaa gcc cag aag tac tca ccg agt cac tcg acc 5949 Leu Asp Ser Ser Lys Ala Gln Lys Tyr Ser Pro Ser His Ser Thr 1970 1975 1980 cgg tcg tgg gcc acc cct cca gca acc cct ccc tac cgg gac tgg 5994 Arg Ser Trp Ala Thr Pro Pro Ala Thr Pro Pro Tyr Arg Asp Trp 1985 1990 1995 aca ccg tgc tac acc ccc ctg atc caa gtg gag cag tca gag gcc 6039 Thr Pro Cys Tyr Thr Pro Leu Ile Gln Val Glu Gln Ser Glu Ala 2000 2005 2010 ctg gac cag gtg aac ggc agc ctg ccg tcc ctg cac cgc agc tcc 6084 Leu Asp Gln Val Asn Gly Ser Leu Pro Ser Leu His Arg Ser Ser 2015 2020 2025 tgg tac aca gac gag ccc gac atc tcc tac cgg act ttc aca cca 6129 Trp Tyr Thr Asp Glu Pro Asp Ile Ser Tyr Arg Thr Phe Thr Pro 2030 2035 2040 gcc agc ctg act gtc ccc agc agc ttc cgg aac aaa aac agc gac 6174 Ala Ser Leu Thr Val Pro Ser Ser Phe Arg Asn Lys Asn Ser Asp 2045 2050 2055 aag cag agg agt gcg gac agc ttg gtg gag gca gtc ctg ata tcc 6219 Lys Gln Arg Ser Ala Asp Ser Leu Val Glu Ala Val Leu Ile Ser 2060 2065 2070 gaa ggc ttg gga cgc tat gca agg gac cca aaa ttt gtg tca gca 6264 Glu Gly Leu Gly Arg Tyr Ala Arg Asp Pro Lys Phe Val Ser Ala 2075 2080 2085 aca aaa cac gaa atc gct gat gcc tgt gac ctc acc atc gac gag 6309 Thr Lys His Glu Ile Ala Asp Ala Cys Asp Leu Thr Ile Asp Glu 2090 2095 2100 atg gag agt gca gcc agc acc ctg ctt aat ggg aac gtg cgt ccc 6354 Met Glu Ser Ala Ala Ser Thr Leu Leu Asn Gly Asn Val Arg Pro 2105 2110 2115 cga gcc aac ggg gat gtg ggc ccc ctc tca cac cgg cag gac tat 6399 Arg Ala Asn Gly Asp Val Gly Pro Leu Ser His Arg Gln Asp Tyr 2120 2125 2130 gag cta cag gac ttt ggt cct ggc tac agc gac gaa gag cca gac 6444 Glu Leu Gln Asp Phe Gly Pro Gly Tyr Ser Asp Glu Glu Pro Asp 2135 2140 2145 cct ggg agg gat gag gag gac ctg gcg gat gaa atg ata tgc atc 6489 Pro Gly Arg Asp Glu Glu Asp Leu Ala Asp Glu Met Ile Cys Ile 2150 2155 2160 acc acc ttg tag 6501 Thr Thr Leu 2165 4 2166 PRT Homo sapiens 4 Met Met Met Met Met Met Met Lys Lys Met Gln His Gln Arg Gln Gln 1 5 10 15 Gln Ala Asp His Ala Asn Glu Ala Asn Tyr Ala Arg Gly Thr Arg Leu 20 25 30 Pro Leu Ser Gly Glu Gly Pro Thr Ser Gln Pro Asn Ser Ser Lys Gln 35 40 45 Thr Val Leu Ser Trp Gln Ala Ala Ile Asp Ala Ala Arg Gln Ala Lys 50 55 60 Ala Ala Gln Thr Met Ser Thr Ser Ala Pro Pro Pro Val Gly Ser Leu 65 70 75 80 Ser Gln Arg Lys Arg Gln Gln Tyr Ala Lys Ser Lys Lys Gln Gly Asn 85 90 95 Ser Ser Asn Ser Arg Pro Ala Arg Ala Leu Phe Cys Leu Ser Leu Asn 100 105 110 Asn Pro Ile Arg Arg Ala Cys Ile Ser Ile Val Glu Trp Lys Pro Phe 115 120 125 Asp Ile Phe Ile Leu Leu Ala Ile Phe Ala Asn Cys Val Ala Leu Ala 130 135 140 Ile Tyr Ile Pro Phe Pro Glu Asp Asp Ser Asn Ser Thr Asn His Asn 145 150 155 160 Leu Glu Lys Val Glu Tyr Ala Phe Leu Ile Ile Phe Thr Val Glu Thr 165 170 175 Phe Leu Lys Ile Ile Ala Tyr Gly Leu Leu Leu His Pro Asn Ala Tyr 180 185 190 Val Arg Asn Gly Trp Asn Leu Leu Asp Phe Val Ile Val Ile Val Gly 195 200 205 Leu Phe Ser Val Ile Leu Glu Gln Leu Thr Lys Glu Thr Glu Gly Gly 210 215 220 Asn His Ser Ser Gly Lys Ser Gly Gly Phe Asp Val Lys Ala Leu Arg 225 230 235 240 Ala Phe Arg Val Leu Arg Pro Leu Arg Leu Val Ser Gly Val Pro Ser 245 250 255 Leu Gln Val Val Leu Asn Ser Ile Ile Lys Ala Met Val Pro Leu Leu 260 265 270 His Ile Ala Leu Leu Val Leu Phe Val Ile Ile Ile Tyr Ala Ile Ile 275 280 285 Gly Leu Glu Leu Phe Ile Gly Lys Met His Lys Thr Cys Phe Phe Ala 290 295 300 Asp Ser Asp Ile Val Ala Glu Glu Asp Pro Ala Pro Cys Ala Phe Ser 305 310 315 320 Gly Asn Gly Arg Gln Cys Thr Ala Asn Gly Thr Glu Cys Arg Ser Gly 325 330 335 Trp Val Gly Pro Asn Gly Gly Ile Thr Asn Phe Asp Asn Phe Ala Phe 340 345 350 Ala Met Leu Thr Val Phe Gln Cys Ile Thr Met Glu Gly Trp Thr Asp 355 360 365 Val Leu Tyr Trp Val Asn Asp Ala Ile Gly Trp Glu Trp Pro Trp Val 370 375 380 Tyr Phe Val Ser Leu Ile Ile Leu Gly Ser Phe Phe Val Leu Asn Leu 385 390 395 400 Val Leu Gly Val Leu Ser Gly Glu Phe Ser Lys Glu Arg Glu Lys Ala 405 410 415 Lys Ala Arg Gly Asp Phe Gln Lys Leu Arg Glu Lys Gln Gln Leu Glu 420 425 430 Glu Asp Leu Lys Gly Tyr Leu Asp Trp Ile Thr Gln Ala Glu Asp Ile 435 440 445 Asp Pro Glu Asn Glu Glu Glu Gly Gly Glu Glu Gly Lys Arg Asn Thr 450 455 460 Ser Met Pro Thr Ser Glu Thr Glu Ser Val Asn Thr Glu Asn Val Ser 465 470 475 480 Gly Glu Gly Glu Asn Arg Gly Cys Cys Gly Ser Leu Trp Cys Trp Trp 485 490 495 Arg Arg Arg Gly Ala Ala Lys Ala Gly Pro Ser Gly Cys Arg Arg Trp 500 505 510 Gly Gln Ala Ile Ser Lys Ser Lys Leu Ser Arg Arg Trp Arg Arg Trp 515 520 525 Asn Arg Phe Asn Arg Arg Arg Cys Arg Ala Ala Val Lys Ser Val Thr 530 535 540 Phe Tyr Trp Leu Val Ile Val Leu Val Phe Leu Asn Thr Leu Thr Ile 545 550 555 560 Ser Ser Glu His Tyr Asn Gln Pro Asp Trp Leu Thr Gln Ile Gln Asp 565 570 575 Ile Ala Asn Lys Val Leu Leu Ala Leu Phe Thr Cys Glu Met Leu Val 580 585 590 Lys Met Tyr Ser Leu Gly Leu Gln Ala Tyr Phe Val Ser Leu Phe Asn 595 600 605 Arg Phe Asp Cys Phe Val Val Cys Gly Gly Ile Thr Glu Thr Ile Leu 610 615 620 Val Glu Leu Glu Ile Met Ser Pro Leu Gly Ile Ser Val Phe Arg Cys 625 630 635 640 Val Arg Leu Leu Arg Ile Phe Lys Val Thr Arg His Trp Thr Ser Leu 645 650 655 Ser Asn Leu Val Ala Ser Leu Leu Asn Ser Met Lys Ser Ile Ala Ser 660 665 670 Leu Leu Leu Leu Leu Phe Leu Phe Ile Ile Ile Phe Ser Leu Leu Gly 675 680 685 Met Gln Leu Phe Gly Gly Lys Phe Asn Phe Asp Glu Thr Gln Thr Lys 690 695 700 Arg Ser Thr Phe Asp Asn Phe Pro Gln Ala Leu Leu Thr Val Phe Gln 705 710 715 720 Ile Leu Thr Gly Glu Asp Trp Asn Ala Val Met Tyr Asp Gly Ile Met 725 730 735 Ala Tyr Gly Gly Pro Ser Ser Ser Gly Met Ile Val Cys Ile Tyr Phe 740 745 750 Ile Ile Leu Phe Ile Cys Gly Asn Tyr Ile Leu Leu Asn Val Phe Leu 755 760 765 Ala Ile Ala Val Asp Asn Leu Ala Asp Ala Glu Ser Leu Asn Thr Ala 770 775 780 Gln Lys Glu Glu Ala Glu Glu Lys Glu Arg Lys Lys Ile Ala Arg Lys 785 790 795 800 Glu Ser Leu Glu Asn Lys Lys Asn Asn Lys Pro Glu Val Asn Gln Ile 805 810 815 Ala Asn Ser Asp Asn Lys Val Thr Ile Asp Asp Tyr Arg Glu Glu Asp 820 825 830 Glu Asp Lys Asp Pro Tyr Pro Pro Cys Asp Val Pro Val Gly Glu Glu 835 840 845 Glu Glu Glu Glu Glu Glu Asp Glu Pro Glu Val Pro Ala Gly Pro Arg 850 855 860 Pro Arg Arg Ile Ser Glu Leu Asn Met Lys Glu Lys Ile Ala Pro Ile 865 870 875 880 Pro Glu Gly Ser Ala Phe Phe Ile Leu Ser Lys Thr Asn Pro Ile Arg 885 890 895 Val Gly Cys His Lys Leu Ile Asn His His Ile Phe Thr Asn Leu Ile 900 905 910 Leu Val Phe Ile Met Leu Ser Ser Ala Ala Leu Ala Ala Glu Asp Pro 915 920 925 Ile Arg Ser His Ser Phe Arg Asn Thr Ile Leu Gly Tyr Phe Asp Tyr 930 935 940 Ala Phe Thr Ala Ile Phe Thr Val Glu Ile Leu Leu Lys Met Thr Thr 945 950 955 960 Phe Gly Ala Phe Leu His Lys Gly Ala Phe Cys Arg Asn Tyr Phe Asn 965 970 975 Leu Leu Asp Met Leu Val Val Gly Val Ser Leu Val Ser Phe Gly Ile 980 985 990 Gln Ser Ser Ala Ile Ser Val Val Lys Ile Leu Arg Val Leu Arg Val 995 1000 1005 Leu Arg Pro Leu Arg Ala Ile Asn Arg Ala Lys Gly Leu Lys His 1010 1015 1020 Val Val Gln Cys Val Phe Val Ala Ile Arg Thr Ile Gly Asn Ile 1025 1030 1035 Met Ile Val Thr Thr Leu Leu Gln Phe Met Phe Ala Cys Ile Gly 1040 1045 1050 Val Gln Leu Phe Lys Gly Lys Phe Tyr Arg Cys Thr Asp Glu Ala 1055 1060 1065 Lys Ser Asn Pro Glu Glu Cys Arg Gly Leu Phe Ile Leu Tyr Lys 1070 1075 1080 Asp Gly Asp Val Asp Ser Pro Val Val Arg Glu Arg Ile Trp Gln 1085 1090 1095 Asn Ser Asp Phe Asn Phe Asp Asn Val Leu Ser Ala Met Met Ala 1100 1105 1110 Leu Phe Thr Val Ser Thr Phe Glu Gly Trp Pro Ala Leu Leu Tyr 1115 1120 1125 Lys Ala Ile Asp Ser Asn Gly Glu Asn Ile Gly Pro Ile Tyr Asn 1130 1135 1140 His Arg Val Glu Ile Ser Ile Phe Phe Ile Ile Tyr Ile Ile Ile 1145 1150 1155 Val Ala Phe Phe Met Met Asn Ile Phe Val Gly Phe Val Ile Val 1160 1165 1170 Thr Phe Gln Glu Gln Gly Glu Lys Glu Tyr Lys Asn Cys Glu Leu 1175 1180 1185 Asp Lys Asn Gln Arg Gln Cys Val Glu Tyr Ala Leu Lys Ala Arg 1190 1195 1200 Pro Leu Arg Arg Tyr Ile Pro Lys Asn Pro Tyr Gln Tyr Lys Phe 1205 1210 1215 Trp Tyr Val Val Asn Ser Ser Pro Phe Glu Tyr Met Met Phe Val 1220 1225 1230 Leu Ile Met Leu Asn Thr Leu Cys Leu Ala Met Gln His Tyr Glu 1235 1240 1245 Gln Ser Lys Met Phe Asn Asp Ala Met Asp Ile Leu Asn Met Val 1250 1255 1260 Phe Thr Gly Val Phe Thr Val Glu Met Val Leu Lys Val Ile Ala 1265 1270 1275 Phe Lys Pro Lys Gly Tyr Phe Ser Asp Ala Trp Asn Thr Phe Asp 1280 1285 1290 Ser Leu Ile Val Ile Gly Ser Ile Ile Asp Val Ala Leu Ser Glu 1295 1300 1305 Ala Asp Asn Ser Glu Glu Ser Asn Arg Ile Ser Ile Thr Phe Phe 1310 1315 1320 Arg Leu Phe Arg Val Met Arg Leu Val Lys Leu Leu Ser Arg Gly 1325 1330 1335 Glu Gly Ile Arg Thr Leu Leu Trp Thr Phe Ile Lys Ser Phe Gln 1340 1345 1350 Ala Leu Pro Tyr Val Ala Leu Leu Ile Ala Met Leu Phe Phe Ile 1355 1360 1365 Tyr Ala Val Ile Gly Met Gln Met Phe Gly Lys Val Ala Met Arg 1370 1375 1380 Asp Asn Asn Gln Ile Asn Arg Asn Asn Asn Phe Gln Thr Phe Pro 1385 1390 1395 Gln Ala Val Leu Leu Leu Phe Arg Cys Ala Thr Gly Glu Ala Trp 1400 1405 1410 Gln Glu Ile Met Leu Ala Cys Leu Pro Gly Lys Leu Cys Asp Pro 1415 1420 1425 Glu Ser Asp Tyr Asn Pro Gly Glu Glu Tyr Thr Cys Gly Ser Asn 1430 1435 1440 Phe Ala Ile Val Tyr Phe Ile Ser Phe Tyr Met Leu Cys Ala Phe 1445 1450 1455 Leu Ile Ile Asn Leu Phe Val Ala Val Ile Met Asp Asn Phe Asp 1460 1465 1470 Tyr Leu Thr Arg Asp Trp Ser Ile Leu Gly Pro His His Leu Asp 1475 1480 1485 Glu Phe Lys Arg Ile Trp Ser Glu Tyr Asp Pro Glu Ala Lys Gly 1490 1495 1500 Arg Ile Lys His Leu Asp Val Val Thr Leu Leu Arg Arg Ile Gln 1505 1510 1515 Pro Pro Leu Gly Phe Gly Lys Leu Cys Pro His Arg Val Ala Cys 1520 1525 1530 Lys Arg Leu Val Ala Met Asn Met Pro Leu Asn Ser Asp Gly Thr 1535 1540 1545 Val Met Phe Asn Ala Thr Leu Phe Ala Leu Val Arg Thr Ala Leu 1550 1555 1560 Lys Ile Lys Thr Glu Gly Asn Leu Glu Gln Ala Asn Glu Glu Leu 1565 1570 1575 Arg Ala Val Ile Lys Lys Ile Trp Lys Lys Thr Ser Met Lys Leu 1580 1585 1590 Leu Asp Gln Val Val Pro Pro Ala Gly Asp Asp Glu Val Thr Val 1595 1600 1605 Gly Lys Phe Tyr Ala Thr Phe Leu Ile Gln Asp Tyr Phe Arg Lys 1610 1615 1620 Phe Lys Lys Arg Lys Glu Gln Gly Leu Val Gly Lys Tyr Pro Ala 1625 1630 1635 Lys Asn Thr Thr Ile Ala Leu Gln Ala Gly Leu Arg Thr Leu His 1640 1645 1650 Asp Ile Gly Pro Glu Ile Arg Arg Ala Ile Ser Cys Asp Leu Gln 1655 1660 1665 Asp Asp Glu Pro Glu Glu Thr Lys Arg Glu Glu Glu Asp Asp Val 1670 1675 1680 Phe Lys Arg Asn Gly Ala Leu Leu Gly Asn His Val Asn His Val 1685 1690 1695 Asn Ser Asp Arg Arg Asp Ser Leu Gln Gln Thr Asn Thr Thr His 1700 1705 1710 Arg Pro Leu His Val Gln Arg Pro Ser Ile Pro Pro Ala Ser Asp 1715 1720 1725 Thr Glu Lys Pro Leu Phe Pro Pro Ala Gly Asn Ser Val Cys His 1730 1735 1740 Asn His His Asn His Asn Ser Ile Gly Lys Gln Val Pro Thr Ser 1745 1750 1755 Thr Asn Ala Asn Leu Asn Asn Ala Asn Met Ser Lys Ala Ala His 1760 1765 1770 Gly Lys Arg Pro Ser Ile Gly Asn Leu Glu His Val Ser Glu Asn 1775 1780 1785 Gly His His Ser Ser His Lys His Asp Arg Glu Pro Gln Arg Arg 1790 1795 1800 Ser Ser Val Lys Arg Thr Arg Tyr Tyr Glu Thr Tyr Ile Arg Ser 1805 1810 1815 Asp Ser Gly Asp Glu Gln Leu Pro Thr Ile Cys Arg Glu Asp Pro 1820 1825 1830 Glu Ile His Gly Tyr Phe Arg Asp Pro His Cys Leu Gly Glu Gln 1835 1840 1845 Glu Tyr Phe Ser Ser Glu Glu Cys Tyr Glu Asp Asp Ser Ser Pro 1850 1855 1860 Thr Trp Ser Arg Gln Asn Tyr Gly Tyr Tyr Ser Arg Tyr Pro Gly 1865 1870 1875 Arg Asn Ile Asp Ser Glu Arg Pro Arg Gly Tyr His His Pro Gln 1880 1885 1890 Gly Phe Leu Glu Asp Asp Asp Ser Pro Val Cys Tyr Asp Ser Arg 1895 1900 1905 Arg Ser Pro Arg Arg Arg Leu Leu Pro Pro Thr Pro Ala Ser His 1910 1915 1920 Arg Arg Ser Ser Phe Asn Phe Glu Cys Leu Arg Arg Gln Ser Ser 1925 1930 1935 Gln Glu Glu Val Pro Ser Ser Pro Ile Phe Pro His Arg Thr Ala 1940 1945 1950 Leu Pro Leu His Leu Met Gln Gln Gln Ile Met Ala Val Ala Gly 1955 1960 1965 Leu Asp Ser Ser Lys Ala Gln Lys Tyr Ser Pro Ser His Ser Thr 1970 1975 1980 Arg Ser Trp Ala Thr Pro Pro Ala Thr Pro Pro Tyr Arg Asp Trp 1985 1990 1995 Thr Pro Cys Tyr Thr Pro Leu Ile Gln Val Glu Gln Ser Glu Ala 2000 2005 2010 Leu Asp Gln Val Asn Gly Ser Leu Pro Ser Leu His Arg Ser Ser 2015 2020 2025 Trp Tyr Thr Asp Glu Pro Asp Ile Ser Tyr Arg Thr Phe Thr Pro 2030 2035 2040 Ala Ser Leu Thr Val Pro Ser Ser Phe Arg Asn Lys Asn Ser Asp 2045 2050 2055 Lys Gln Arg Ser Ala Asp Ser Leu Val Glu Ala Val Leu Ile Ser 2060 2065 2070 Glu Gly Leu Gly Arg Tyr Ala Arg Asp Pro Lys Phe Val Ser Ala 2075 2080 2085 Thr Lys His Glu Ile Ala Asp Ala Cys Asp Leu Thr Ile Asp Glu 2090 2095 2100 Met Glu Ser Ala Ala Ser Thr Leu Leu Asn Gly Asn Val Arg Pro 2105 2110 2115 Arg Ala Asn Gly Asp Val Gly Pro Leu Ser His Arg Gln Asp Tyr 2120 2125 2130 Glu Leu Gln Asp Phe Gly Pro Gly Tyr Ser Asp Glu Glu Pro Asp 2135 2140 2145 Pro Gly Arg Asp Glu Glu Asp Leu Ala Asp Glu Met Ile Cys Ile 2150 2155 2160 Thr Thr Leu 2165 5 557 DNA Homo sapiens partial_cDNA (1)..(557) 5 ctgattgtca tcggcagcat cattgacgtc atcctcagtg agatcgacga ccccgacgag 60 agtgcccgca tctccagcgc cttcttccgc ctgttccggg tcatgaggtt gatcaagctg 120 ctgagccgcg ccgagggcgt gcgcacgctg ctccggacct tcatcaagtc cttccaggcc 180 ctgccctacg tggctctgct catcgtcatg ctcttcttca tctatgctgt catcggcatg 240 cagatgtttg ggaagatcgc catggtggac ggaacccaga taaaccggaa caacaatttc 300 cagaccttcc ctcaggctgt gctgctgctc ttcaggtgcg ccacgggtga ggcgtggcag 360 gagatcctgc tggcctgcag gtacgggcag ctgtgcgacc ccgagtcaga ctacctccct 420 ggggaggagt atacctgcgg caccgacttc gcctactact acttcatcag cttctacatg 480 ctctgtgcct tcctgatcat caacctcttt gtggctgtga tcatggacaa ttttgactac 540 ctcacccgga ctggtcc 557 6 553 DNA Homo sapiens partial_cDNA (1)..(553) n is dATP, dCTP, dGTP, or dTTP 6 ctcatcgtaa tcggcagcat tatagacgtg gccctcagcg aagcagacaa ctctgaagag 60 agcaatagaa tctccatcac ctttttccgt cttttccgag tgatgcgatt ggtgaagctt 120 ctcagcaggg gggaaggcat ccggacattg ctgtggactt ttattaagtc ctttcaggcg 180 ctcccgtatg tggccctcct catagccatg ctgttcttca tctatgcggt cattggcatg 240 cagatgtttg ggaaagttgc catgagagat aacaaccaga tcaataggaa caataacttc 300 cagacgtttc cccaggcggt gctgctgctc ttcaggtgtg caacaggtga ggcctggcag 360 gagatcatgc tggcctgtct cccagggaag ctctgtgacc ctgagtcaga ttacaacctc 420 ggggaggagt atacatgtgg gagcaacttt gccattgtct atttcatcag nttttacatg 480 ctctgtgcat ttctgacatc aatctgtttg gtggctgtca tcatggataa tttcgactat 540 ctgaccccng gac 553 7 498 DNA pig partial_cDNA (1)..(498) 7 tggaatgtgt tcgacttcct gattgtcatc ggcagcatca ttgacgtcat cctcagtgag 60 atcgacgacc ccgacgagag tgcccgcatc tccagcgcct tcttccgcct gttccgggtc 120 atgaggttga tcaagctgct gagccgcgcc gagggcgtgc gcacgctgct ctggaccttc 180 atcaagtcct tccaggccct gccctacgtg gctctgctca tcgtcatgct cttcttcatc 240 tatgctgtca tcggcatgca gatgtttggg aagatcgcca tggtggacgg aacccagata 300 aaccggaaca acaatttcca gaccttccct caggctgtgc tgctgctctt caggtgcgcc 360 acgggtgagg cgtggcagga gatcctgctg gcctgcaggt acgggcagct gtgcgacccc 420 gagtcagact acctccctgg ggaggagtat acctgcggca ccgacttcgc ctactactac 480 ttcatcagct tctacatg 498 8 573 DNA pig partial_cDNA (1)..(573) n is dATP, dCTP, dGTP, or dTTP Xaa is any amino acid 8 tggcgggatg acctggtcca ggagcttcat gctggttctc ttccagatct tcttgataat 60 ggccctcagc tcctcgttgg cctgctcaaa gttaccttcc gtcttgatct tgagtgccgt 120 tcggaccagg gcgaagaggg tggcattgaa ggtgaccgag ccatcgctat tcaggggcat 180 gttcatgccc accagccgct tacatgccac ccggtgtgga caaaatttcc caaagcccag 240 agggggctgg atccttctca gcagggtcac cacgtccagg tgcttgattc tgcccttagc 300 ttctgggtcg tactctgccc agatggcctt gaactcgtcc aggtgatgtg ggcccaggat 360 ggaccagtcc cgggtgaggt agtcaaaatt gtccatgatc acagccacaa agaggttgat 420 gatcaggaag gcacagagca tgtagaagct gatgaagtag tagtaggcga agtcggtgcc 480 cgcaggtata ctcctcccca gggaggtagt ctgactcggg gtcgcacagc ttgccgtacc 540 tgcaggccan caggatctcc tgcacgcctt acc 573 9 6615 DNA Homo sapiens gene (1)..(6615) n is dATP, dCTP, dGTP, or dTTP 9 caggcccggc agcggggagc cgagtggagg ctaattttac ttgctgggag cgaggagagt 60 aatcctcctg cccccactcc tgcccccgcc ccctggctgg ctcagcaggg caggctcagc 120 cgacagcctc agccagccta gtccccaagg cgggggcatt ggggacacag ggaagggaaa 180 gcactggggt gggggagcag gagaaagcca gattcccagg gaagccatgg agccatcctc 240 accccaggat gaaggcctga ggaagaaaca gcccaagaag ccagttcctg agattctgcc 300 aaggccaccc cgggccttgt tctgcctgac cctggagaac cccctgagga aggcctgcat 360 cagcattgta gaatggaagc ccttcgagac gatcatcttg ctcaccatct ttgccaattg 420 tgtggccctg gccgtgtacc tgcccatgcc ggaagatgac aacaactctc tgaacctcgg 480 cctggagaag ctggagtatt tcttcctcat tgtcttctcg attgaagccg ccatgaagat 540 cattgcctac ggcttcttat tccaccagga cgcttacctg cgcagtggct ggaatgtgct 600 ggacttcacc attgtcttcc tgggggtctt caccgtgatt ctggacaagg ttaacgtcat 660 ccaaagccac acagccccat gagcagcaaa ggagccggct tggatgtcaa ggccctcaga 720 gccttccgag tgctcagacc cctccggctg gtgtcggggg tgcctagcct gcaggtggtc 780 ctgaactcca tcttcaaggc catgctcccc ctctttcaca tcgccctgct ggtcctcttt 840 atggtcatca tctatgccat catcgggctg gagctcttca agggcaagat gcacaagacc 900 tgctacttca ttggtacaga tatcgtggcc acggtggaga atgaagagcc atcgccctgc 960 gccaggacgg gctcagggcg ccggtgcacc atcaatggca gtgagtgccg gggcggctgg 1020 ccagggccca accatggcat cacccacttc gacaacttcg gcttctccat gctcaccgtg 1080 taccagtgca ttaccatgga gggatggact gacgtccttt actgggtcaa tgatgccatc 1140 gggaatgagt ggccctggat ctattttgtc accctcattt tgctgggatc cttcttcatc 1200 ctcaacctgg tgctgggtgt cctgagtggg gaattcacca aggagcggga gaaggccaag 1260 tccaggggaa ccttccagaa gctccgggag aagcagcaac tagatgagga ccttcggggc 1320 tacatgagct ggatcacgca gggcgaggtc atggatgttg aggacttcag agaaggaaaa 1380 ctgtctttgg atgaaggtgg ctctgacaca gagagcctgt atgaaattgc aggcttgaac 1440 aaaatcatcc agttcatccg acattggagg cagtggaacc gcatctttcg ctggaagtgc 1500 catgacatcg tgaagtccaa ggtcttctat tggctggtga ttctcatcgt tgccctcaac 1560 accctgtcta tcgcctcaga gcaccacaac cagcctctct ggctgacccg tttgcaagac 1620 attgccaacc gggtgctgct gtccctcttc accactgaga tgctgatgaa gatgtacggg 1680 ctgggcctgc gccagtactt catgtctatc ttcaaccgct tcgactgctt cgtggtgtgc 1740 agcggtatcc tggagatcct gctggtggag tcgggcgcca tgacacccct gggcatctcc 1800 gtgctccgct gcatccgcct cctgaggatc ttcaagatca ccaaatattg gacgtcgctg 1860 agcaacctgg tggcatccct gctcaactcc atccgctcca tcgcctccct gctgctgctg 1920 ctcttcctct tcatcgtcat cttcgccctc ctgggcatgc agctctttgg ggggaggtat 1980 gactttgaag acacagaagt acggcgcagc aactttgaca actttcccca agccctcatc 2040 agcgtcttcc aggtactgac aggggaagac tggacctcaa tgatgtacaa tgggatcatg 2100 gcctacggcg ggccgtccta ccctggcatg cttgtgtgca tttacttcat catccttttc 2160 gtctgtggca actacatcct gctcaatgtc ttcctggcca ttgccgtgga caacctggcc 2220 gaggcggaga gcctgacttc tgcccagaac ggccaaggct gaggagaaaa aacgcaggaa 2280 g atg tcc aag ggt ctc cca gac aag tca gaa gag gag aag tca acg atg 2329 Met Ser Lys Gly Leu Pro Asp Lys Ser Glu Glu Glu Lys Ser Thr Met 1 5 10 15 gcc aag aag ctg gag cag aaa ccc aag ggt gag ggc atc ccc acc act 2377 Ala Lys Lys Leu Glu Gln Lys Pro Lys Gly Glu Gly Ile Pro Thr Thr 20 25 30 gcc aag ctg ana atc gat gag ttn gaa tct aat gtc aat gag gtg aag 2425 Ala Lys Leu Xaa Ile Asp Glu Xaa Glu Ser Asn Val Asn Glu Val Lys 35 40 45 gat ccc tac ccc tca gcc gac ttc cca ggg gat gac gag gaa gat gag 2473 Asp Pro Tyr Pro Ser Ala Asp Phe Pro Gly Asp Asp Glu Glu Asp Glu 50 55 60 cct gag atc ccg ctg agc ccc cga cca cgt ccc ctg gct gag ctg cag 2521 Pro Glu Ile Pro Leu Ser Pro Arg Pro Arg Pro Leu Ala Glu Leu Gln 65 70 75 80 ctg aaa gag aag gcc gtg ccc att cca gaa gcc agc tcc ttc ttc atc 2569 Leu Lys Glu Lys Ala Val Pro Ile Pro Glu Ala Ser Ser Phe Phe Ile 85 90 95 ttc agc ccc acc aat aag atc cgt gtc ctg tgt cac cgc atc gtc aat 2617 Phe Ser Pro Thr Asn Lys Ile Arg Val Leu Cys His Arg Ile Val Asn 100 105 110 gcc acc tgg ttc acc aac ttc atc ctg ctc ttc atc ctg ctc agc agc 2665 Ala Thr Trp Phe Thr Asn Phe Ile Leu Leu Phe Ile Leu Leu Ser Ser 115 120 125 gct gca ctg gct gcg gaa gac ccc atc cgg gct gat tcc atg aga aat 2713 Ala Ala Leu Ala Ala Glu Asp Pro Ile Arg Ala Asp Ser Met Arg Asn 130 135 140 cag atc ctt aaa cac ttt gac atc ggg ttc acc tct gtc ttc act gtg 2761 Gln Ile Leu Lys His Phe Asp Ile Gly Phe Thr Ser Val Phe Thr Val 145 150 155 160 gag att gtc ctc aag atg acg acc tac gga gcc ttc ctg cac aag ggt 2809 Glu Ile Val Leu Lys Met Thr Thr Tyr Gly Ala Phe Leu His Lys Gly 165 170 175 tcc ttc tgc cgc aat tac ttc aac atg ctg gac ctg ctg gtg gtg gcc 2857 Ser Phe Cys Arg Asn Tyr Phe Asn Met Leu Asp Leu Leu Val Val Ala 180 185 190 gtg tcc ctc atc tcc atg gga ctt gag tcc agt gcc atc tcc gtg gtg 2905 Val Ser Leu Ile Ser Met Gly Leu Glu Ser Ser Ala Ile Ser Val Val 195 200 205 aag atc ctg agg gtg ctg agg gtg ctc cga cca ctc aga gcc atc aac 2953 Lys Ile Leu Arg Val Leu Arg Val Leu Arg Pro Leu Arg Ala Ile Asn 210 215 220 aga gcc aag ggg ttg aag gtg agn nnn nnn nnn nnn nnn nnn nnn nnn 3001 Arg Ala Lys Gly Leu Lys Val Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 225 230 235 240 nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn 3049 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 245 250 255 nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn 3097 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 260 265 270 nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn 3145 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 275 280 285 nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn 3193 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 290 295 300 nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn 3241 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 305 310 315 320 nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn tgc cac agg ggc tac 3289 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys His Arg Gly Tyr 325 330 335 tac tac gtg tac aag gac ggg gac ccc atg cag ata gag ctg cgt cac 3337 Tyr Tyr Val Tyr Lys Asp Gly Asp Pro Met Gln Ile Glu Leu Arg His 340 345 350 cgc gag tgg gta cac agc gac ttc cac ttc gac aat gtg ctc tca gcc 3385 Arg Glu Trp Val His Ser Asp Phe His Phe Asp Asn Val Leu Ser Ala 355 360 365 atg atg tcc ctc ttc acg gtc tcc acc ttc gag gga tgg cct cag ctg 3433 Met Met Ser Leu Phe Thr Val Ser Thr Phe Glu Gly Trp Pro Gln Leu 370 375 380 ctg tac aag gcc ata gac tcc aat gcg gag gac gtg ggt ccc atc tac 3481 Leu Tyr Lys Ala Ile Asp Ser Asn Ala Glu Asp Val Gly Pro Ile Tyr 385 390 395 400 aac aac cgt gtg gag atg gcc atc ttc ttc atc atc tac atc atc ctc 3529 Asn Asn Arg Val Glu Met Ala Ile Phe Phe Ile Ile Tyr Ile Ile Leu 405 410 415 att gcc ttc ttc atg atg aac atc ttt gtg ggc ttc gtc att gtc acc 3577 Ile Ala Phe Phe Met Met Asn Ile Phe Val Gly Phe Val Ile Val Thr 420 425 430 ttc cag gag cag gga gag act gag tac aag aac tgt gag ctg gac aag 3625 Phe Gln Glu Gln Gly Glu Thr Glu Tyr Lys Asn Cys Glu Leu Asp Lys 435 440 445 aac cag cgc caa tgt gta cag tat gcc ctg aag gcc cgc cca ctg agg 3673 Asn Gln Arg Gln Cys Val Gln Tyr Ala Leu Lys Ala Arg Pro Leu Arg 450 455 460 tgc tac att ccc aaa aac cca tac cag tac cag gtg tgg tac att gtc 3721 Cys Tyr Ile Pro Lys Asn Pro Tyr Gln Tyr Gln Val Trp Tyr Ile Val 465 470 475 480 acc tcc tcc tac ttt gaa tac ctg atg ttt gcc ctc atc atg ctc aac 3769 Thr Ser Ser Tyr Phe Glu Tyr Leu Met Phe Ala Leu Ile Met Leu Asn 485 490 495 acc atc tgc ctc ggc atg cag cac tac aac cag tcg gag cag atg aac 3817 Thr Ile Cys Leu Gly Met Gln His Tyr Asn Gln Ser Glu Gln Met Asn 500 505 510 cac atc tca gac atc ctc aat gtg gcc ttc act atc atc ttc acc ctg 3865 His Ile Ser Asp Ile Leu Asn Val Ala Phe Thr Ile Ile Phe Thr Leu 515 520 525 gag atg atc ctc aag ctc atg gcc ttc aag gcc agg ggc tac ttt gga 3913 Glu Met Ile Leu Lys Leu Met Ala Phe Lys Ala Arg Gly Tyr Phe Gly 530 535 540 gac ccc tgg aat gtg ttt gac ttc ctg att gtc att ggc agc atc att 3961 Asp Pro Trp Asn Val Phe Asp Phe Leu Ile Val Ile Gly Ser Ile Ile 545 550 555 560 gat gtc atc ctc agt gag atc gac act ttc ctg gcc tcc agc ggg gga 4009 Asp Val Ile Leu Ser Glu Ile Asp Thr Phe Leu Ala Ser Ser Gly Gly 565 570 575 ctg tat tgc ctg ggt gga ggc tgc ggg aac gtt gac cca gat gag agt 4057 Leu Tyr Cys Leu Gly Gly Gly Cys Gly Asn Val Asp Pro Asp Glu Ser 580 585 590 gcc cgc atc tcc agc gcc ttc ttc cgc ctg ttc cgt gtc atg agg ctg 4105 Ala Arg Ile Ser Ser Ala Phe Phe Arg Leu Phe Arg Val Met Arg Leu 595 600 605 atc aag ctg ctg agc cgg gca gaa gga gtg cga acc ctc ctg tgg acg 4153 Ile Lys Leu Leu Ser Arg Ala Glu Gly Val Arg Thr Leu Leu Trp Thr 610 615 620 ttc atc aag tcc ttc cag gcc cta ccc tac gtg gct ctg ctc atc gtc 4201 Phe Ile Lys Ser Phe Gln Ala Leu Pro Tyr Val Ala Leu Leu Ile Val 625 630 635 640 atg ctc ttc ttc atc tac gct gtc atc ggc atg cag atg ttt ggg aag 4249 Met Leu Phe Phe Ile Tyr Ala Val Ile Gly Met Gln Met Phe Gly Lys 645 650 655 atc gcc ttg gtg gat ggg acc caa ata aac cgg aac aac aac ttc cag 4297 Ile Ala Leu Val Asp Gly Thr Gln Ile Asn Arg Asn Asn Asn Phe Gln 660 665 670 acc ttc cca caa gct gtg cta ctg ctc ttc agg tgt gca aca ggt gag 4345 Thr Phe Pro Gln Ala Val Leu Leu Leu Phe Arg Cys Ala Thr Gly Glu 675 680 685 gcc tgg cag gag atc cta ctg gcc tgc agc tat ggg aag ctg tgt gac 4393 Ala Trp Gln Glu Ile Leu Leu Ala Cys Ser Tyr Gly Lys Leu Cys Asp 690 695 700 cca gag tcg gac tat gcc cca ggg gag gag tac aca tgt ggc acc aac 4441 Pro Glu Ser Asp Tyr Ala Pro Gly Glu Glu Tyr Thr Cys Gly Thr Asn 705 710 715 720 ttt gca tac tac tac ttc atc agc ttc tac atg ctc tgt gcc ttc ctg 4489 Phe Ala Tyr Tyr Tyr Phe Ile Ser Phe Tyr Met Leu Cys Ala Phe Leu 725 730 735 gtc atc aac ctc ttt gtg gct gtc atc atg gac aat ttt gac tac ctc 4537 Val Ile Asn Leu Phe Val Ala Val Ile Met Asp Asn Phe Asp Tyr Leu 740 745 750 acc cgg gac tgg tcc atc ctg ggc cct cat cac ctg gat gag ttc aag 4585 Thr Arg Asp Trp Ser Ile Leu Gly Pro His His Leu Asp Glu Phe Lys 755 760 765 gcc atc tgg gca gag tat gac cca gag gct aag ggg aga atc aaa cac 4633 Ala Ile Trp Ala Glu Tyr Asp Pro Glu Ala Lys Gly Arg Ile Lys His 770 775 780 ctg gac gtg gtg acc ctg ctg aga agg att cag ccc cct ctg ggc ttt 4681 Leu Asp Val Val Thr Leu Leu Arg Arg Ile Gln Pro Pro Leu Gly Phe 785 790 795 800 ggg aag ttc tgc cca cat cgg gta gct tgt aag cgg ctg gtg ggc atg 4729 Gly Lys Phe Cys Pro His Arg Val Ala Cys Lys Arg Leu Val Gly Met 805 810 815 aac atg ccc ctg aac agc gac ggc aca gtc acc ttc aat gcc aca ctc 4777 Asn Met Pro Leu Asn Ser Asp Gly Thr Val Thr Phe Asn Ala Thr Leu 820 825 830 ttt gcc ctg gtc cgc acg gca ctc aag atc aag acg gaa ggt aac ttt 4825 Phe Ala Leu Val Arg Thr Ala Leu Lys Ile Lys Thr Glu Gly Asn Phe 835 840 845 gag cag gcc aac gag gag ctg agg gcc atc atc aag aag atc tgg aag 4873 Glu Gln Ala Asn Glu Glu Leu Arg Ala Ile Ile Lys Lys Ile Trp Lys 850 855 860 aga acc agc atg aag ctc ttg gac cag gtc atc cct cca ata gga gat 4921 Arg Thr Ser Met Lys Leu Leu Asp Gln Val Ile Pro Pro Ile Gly Asp 865 870 875 880 gat gag gtg aca gtg ggg aag ttc tac gcc aca ttc ctc atc cag gag 4969 Asp Glu Val Thr Val Gly Lys Phe Tyr Ala Thr Phe Leu Ile Gln Glu 885 890 895 cac ttc cgg aag ttc atg aaa cgc caa gag gag tat tat ggc tat cgg 5017 His Phe Arg Lys Phe Met Lys Arg Gln Glu Glu Tyr Tyr Gly Tyr Arg 900 905 910 ccc aag aag gac att gta cag atc cag gca ggg ctg cgg acc att gag 5065 Pro Lys Lys Asp Ile Val Gln Ile Gln Ala Gly Leu Arg Thr Ile Glu 915 920 925 gaa gag gca gcc ccc gag atc tgt cgc acg gtc tca gga gac ctg gct 5113 Glu Glu Ala Ala Pro Glu Ile Cys Arg Thr Val Ser Gly Asp Leu Ala 930 935 940 gct gag gag gag ctg gag aga gcc atg gtg gag gct gcg atg gag gag 5161 Ala Glu Glu Glu Leu Glu Arg Ala Met Val Glu Ala Ala Met Glu Glu 945 950 955 960 gga ata ttc cgg agg act gga ggc ctg ttt ggc cag gtg gac aac ttc 5209 Gly Ile Phe Arg Arg Thr Gly Gly Leu Phe Gly Gln Val Asp Asn Phe 965 970 975 ctg gaa agg acc aac tcc ctg ccc ccc gtc atg gcc aat cag aga ccc 5257 Leu Glu Arg Thr Asn Ser Leu Pro Pro Val Met Ala Asn Gln Arg Pro 980 985 990 ctc cag ttt gct gag ata gag atg gaa gag atg gag tca cct gtc ttc 5305 Leu Gln Phe Ala Glu Ile Glu Met Glu Glu Met Glu Ser Pro Val Phe 995 1000 1005 ttg gag gac ttc cca caa gat cca cgc acc aac ccc ctg gct cgt 5350 Leu Glu Asp Phe Pro Gln Asp Pro Arg Thr Asn Pro Leu Ala Arg 1010 1015 1020 gcc aat acc aac aat gcc aac gcc aat gtc gcc tat ggc aac agc 5395 Ala Asn Thr Asn Asn Ala Asn Ala Asn Val Ala Tyr Gly Asn Ser 1025 1030 1035 aac cat agc aac agc cat gtg ttt tcc agt gtc cac tat gaa agg 5440 Asn His Ser Asn Ser His Val Phe Ser Ser Val His Tyr Glu Arg 1040 1045 1050 gag ttc cca gaa gag aca gag acg cct gct acc aga gga cga gcc 5485 Glu Phe Pro Glu Glu Thr Glu Thr Pro Ala Thr Arg Gly Arg Ala 1055 1060 1065 ctt ggc caa ccc tgc agg gtc cnn nnn nnn nnn nnn nnn nnn nnn 5530 Leu Gly Gln Pro Cys Arg Val Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 1070 1075 1080 nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn 5575 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 1085 1090 1095 nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn 5620 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 1100 1105 1110 nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn 5665 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 1115 1120 1125 nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn 5710 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 1130 1135 1140 nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn 5755 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 1145 1150 1155 nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn 5800 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 1160 1165 1170 nnn nnn aca gga ccc cac agc aaa ccc tgt gtg gag atg ctg aag 5845 Xaa Xaa Thr Gly Pro His Ser Lys Pro Cys Val Glu Met Leu Lys 1175 1180 1185 gga ctg ctg acc cag agg gca atg ccc aga ggc cag gca cct cct 5890 Gly Leu Leu Thr Gln Arg Ala Met Pro Arg Gly Gln Ala Pro Pro 1190 1195 1200 gcc ccc tgc cag tgc ccc agg gtg gag tcc tcc atg cct gag gac 5935 Ala Pro Cys Gln Cys Pro Arg Val Glu Ser Ser Met Pro Glu Asp 1205 1210 1215 aga aag agc tcc aca cca ggg tct ctt cat gag gag aca ccc cac 5980 Arg Lys Ser Ser Thr Pro Gly Ser Leu His Glu Glu Thr Pro His 1220 1225 1230 agc agg agc acc agg gag aat act tcc agg tgc tca gca cca gct 6025 Ser Arg Ser Thr Arg Glu Asn Thr Ser Arg Cys Ser Ala Pro Ala 1235 1240 1245 aca gcc ctg ctg atc caa aag gct ctg gtt cga ggg ggc ctg ggc 6070 Thr Ala Leu Leu Ile Gln Lys Ala Leu Val Arg Gly Gly Leu Gly 1250 1255 1260 acc ttg gca gct gat gca aac ttc atc atg gca aca ggc cag gcc 6115 Thr Leu Ala Ala Asp Ala Asn Phe Ile Met Ala Thr Gly Gln Ala 1265 1270 1275 ctg gca gat gcc tgc caa atg gaa cca gag gaa gtg gag atc atg 6160 Leu Ala Asp Ala Cys Gln Met Glu Pro Glu Glu Val Glu Ile Met 1280 1285 1290 gca aca gag cta ctg aaa gga cga gag gcc cca gag ggc atg gcc 6205 Ala Thr Glu Leu Leu Lys Gly Arg Glu Ala Pro Glu Gly Met Ala 1295 1300 1305 agc tcc ctg gga tgc ctg aac ctc ggg tcc tcc ctg ggc agc ctc 6250 Ser Ser Leu Gly Cys Leu Asn Leu Gly Ser Ser Leu Gly Ser Leu 1310 1315 1320 gac caa cac cag ggc tcc cag gag acc ctt att cct cca agg ctg 6295 Asp Gln His Gln Gly Ser Gln Glu Thr Leu Ile Pro Pro Arg Leu 1325 1330 1335 tga tgcccacaca gcatcagcat gggcttagag ctggcatgac caatgggggt 6348 ggggaagttg ctggggtgga gaagggctag cccaccgcag cagcctccct ccctctcagc 6408 agctagatgc atggcctgag gcagggtggt caggaaccac ctcaaaaagt gcggaggaag 6468 tagctggaca ggccctgccc ctcaccagca agaggcatga ttggatggag cttctaatgt 6528 cattcaaaaa ggcctggtca gtgcctgtct ggcctagggc cactcccacc tgcaggacat 6588 taaaatctcc aggcctgtga cactggc 6615 10 1338 PRT Homo sapiens misc_feature (36)..(36) The ′Xaa′ at location 36 stands for Lys, Arg, Thr, or Ile. 10 Met Ser Lys Gly Leu Pro Asp Lys Ser Glu Glu Glu Lys Ser Thr Met 1 5 10 15 Ala Lys Lys Leu Glu Gln Lys Pro Lys Gly Glu Gly Ile Pro Thr Thr 20 25 30 Ala Lys Leu Xaa Ile Asp Glu Xaa Glu Ser Asn Val Asn Glu Val Lys 35 40 45 Asp Pro Tyr Pro Ser Ala Asp Phe Pro Gly Asp Asp Glu Glu Asp Glu 50 55 60 Pro Glu Ile Pro Leu Ser Pro Arg Pro Arg Pro Leu Ala Glu Leu Gln 65 70 75 80 Leu Lys Glu Lys Ala Val Pro Ile Pro Glu Ala Ser Ser Phe Phe Ile 85 90 95 Phe Ser Pro Thr Asn Lys Ile Arg Val Leu Cys His Arg Ile Val Asn 100 105 110 Ala Thr Trp Phe Thr Asn Phe Ile Leu Leu Phe Ile Leu Leu Ser Ser 115 120 125 Ala Ala Leu Ala Ala Glu Asp Pro Ile Arg Ala Asp Ser Met Arg Asn 130 135 140 Gln Ile Leu Lys His Phe Asp Ile Gly Phe Thr Ser Val Phe Thr Val 145 150 155 160 Glu Ile Val Leu Lys Met Thr Thr Tyr Gly Ala Phe Leu His Lys Gly 165 170 175 Ser Phe Cys Arg Asn Tyr Phe Asn Met Leu Asp Leu Leu Val Val Ala 180 185 190 Val Ser Leu Ile Ser Met Gly Leu Glu Ser Ser Ala Ile Ser Val Val 195 200 205 Lys Ile Leu Arg Val Leu Arg Val Leu Arg Pro Leu Arg Ala Ile Asn 210 215 220 Arg Ala Lys Gly Leu Lys Val Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 225 230 235 240 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 245 250 255 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 260 265 270 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 275 280 285 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 290 295 300 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 305 310 315 320 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys His Arg Gly Tyr 325 330 335 Tyr Tyr Val Tyr Lys Asp Gly Asp Pro Met Gln Ile Glu Leu Arg His 340 345 350 Arg Glu Trp Val His Ser Asp Phe His Phe Asp Asn Val Leu Ser Ala 355 360 365 Met Met Ser Leu Phe Thr Val Ser Thr Phe Glu Gly Trp Pro Gln Leu 370 375 380 Leu Tyr Lys Ala Ile Asp Ser Asn Ala Glu Asp Val Gly Pro Ile Tyr 385 390 395 400 Asn Asn Arg Val Glu Met Ala Ile Phe Phe Ile Ile Tyr Ile Ile Leu 405 410 415 Ile Ala Phe Phe Met Met Asn Ile Phe Val Gly Phe Val Ile Val Thr 420 425 430 Phe Gln Glu Gln Gly Glu Thr Glu Tyr Lys Asn Cys Glu Leu Asp Lys 435 440 445 Asn Gln Arg Gln Cys Val Gln Tyr Ala Leu Lys Ala Arg Pro Leu Arg 450 455 460 Cys Tyr Ile Pro Lys Asn Pro Tyr Gln Tyr Gln Val Trp Tyr Ile Val 465 470 475 480 Thr Ser Ser Tyr Phe Glu Tyr Leu Met Phe Ala Leu Ile Met Leu Asn 485 490 495 Thr Ile Cys Leu Gly Met Gln His Tyr Asn Gln Ser Glu Gln Met Asn 500 505 510 His Ile Ser Asp Ile Leu Asn Val Ala Phe Thr Ile Ile Phe Thr Leu 515 520 525 Glu Met Ile Leu Lys Leu Met Ala Phe Lys Ala Arg Gly Tyr Phe Gly 530 535 540 Asp Pro Trp Asn Val Phe Asp Phe Leu Ile Val Ile Gly Ser Ile Ile 545 550 555 560 Asp Val Ile Leu Ser Glu Ile Asp Thr Phe Leu Ala Ser Ser Gly Gly 565 570 575 Leu Tyr Cys Leu Gly Gly Gly Cys Gly Asn Val Asp Pro Asp Glu Ser 580 585 590 Ala Arg Ile Ser Ser Ala Phe Phe Arg Leu Phe Arg Val Met Arg Leu 595 600 605 Ile Lys Leu Leu Ser Arg Ala Glu Gly Val Arg Thr Leu Leu Trp Thr 610 615 620 Phe Ile Lys Ser Phe Gln Ala Leu Pro Tyr Val Ala Leu Leu Ile Val 625 630 635 640 Met Leu Phe Phe Ile Tyr Ala Val Ile Gly Met Gln Met Phe Gly Lys 645 650 655 Ile Ala Leu Val Asp Gly Thr Gln Ile Asn Arg Asn Asn Asn Phe Gln 660 665 670 Thr Phe Pro Gln Ala Val Leu Leu Leu Phe Arg Cys Ala Thr Gly Glu 675 680 685 Ala Trp Gln Glu Ile Leu Leu Ala Cys Ser Tyr Gly Lys Leu Cys Asp 690 695 700 Pro Glu Ser Asp Tyr Ala Pro Gly Glu Glu Tyr Thr Cys Gly Thr Asn 705 710 715 720 Phe Ala Tyr Tyr Tyr Phe Ile Ser Phe Tyr Met Leu Cys Ala Phe Leu 725 730 735 Val Ile Asn Leu Phe Val Ala Val Ile Met Asp Asn Phe Asp Tyr Leu 740 745 750 Thr Arg Asp Trp Ser Ile Leu Gly Pro His His Leu Asp Glu Phe Lys 755 760 765 Ala Ile Trp Ala Glu Tyr Asp Pro Glu Ala Lys Gly Arg Ile Lys His 770 775 780 Leu Asp Val Val Thr Leu Leu Arg Arg Ile Gln Pro Pro Leu Gly Phe 785 790 795 800 Gly Lys Phe Cys Pro His Arg Val Ala Cys Lys Arg Leu Val Gly Met 805 810 815 Asn Met Pro Leu Asn Ser Asp Gly Thr Val Thr Phe Asn Ala Thr Leu 820 825 830 Phe Ala Leu Val Arg Thr Ala Leu Lys Ile Lys Thr Glu Gly Asn Phe 835 840 845 Glu Gln Ala Asn Glu Glu Leu Arg Ala Ile Ile Lys Lys Ile Trp Lys 850 855 860 Arg Thr Ser Met Lys Leu Leu Asp Gln Val Ile Pro Pro Ile Gly Asp 865 870 875 880 Asp Glu Val Thr Val Gly Lys Phe Tyr Ala Thr Phe Leu Ile Gln Glu 885 890 895 His Phe Arg Lys Phe Met Lys Arg Gln Glu Glu Tyr Tyr Gly Tyr Arg 900 905 910 Pro Lys Lys Asp Ile Val Gln Ile Gln Ala Gly Leu Arg Thr Ile Glu 915 920 925 Glu Glu Ala Ala Pro Glu Ile Cys Arg Thr Val Ser Gly Asp Leu Ala 930 935 940 Ala Glu Glu Glu Leu Glu Arg Ala Met Val Glu Ala Ala Met Glu Glu 945 950 955 960 Gly Ile Phe Arg Arg Thr Gly Gly Leu Phe Gly Gln Val Asp Asn Phe 965 970 975 Leu Glu Arg Thr Asn Ser Leu Pro Pro Val Met Ala Asn Gln Arg Pro 980 985 990 Leu Gln Phe Ala Glu Ile Glu Met Glu Glu Met Glu Ser Pro Val Phe 995 1000 1005 Leu Glu Asp Phe Pro Gln Asp Pro Arg Thr Asn Pro Leu Ala Arg 1010 1015 1020 Ala Asn Thr Asn Asn Ala Asn Ala Asn Val Ala Tyr Gly Asn Ser 1025 1030 1035 Asn His Ser Asn Ser His Val Phe Ser Ser Val His Tyr Glu Arg 1040 1045 1050 Glu Phe Pro Glu Glu Thr Glu Thr Pro Ala Thr Arg Gly Arg Ala 1055 1060 1065 Leu Gly Gln Pro Cys Arg Val Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 1070 1075 1080 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 1085 1090 1095 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 1100 1105 1110 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 1115 1120 1125 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 1130 1135 1140 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 1145 1150 1155 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 1160 1165 1170 Xaa Xaa Thr Gly Pro His Ser Lys Pro Cys Val Glu Met Leu Lys 1175 1180 1185 Gly Leu Leu Thr Gln Arg Ala Met Pro Arg Gly Gln Ala Pro Pro 1190 1195 1200 Ala Pro Cys Gln Cys Pro Arg Val Glu Ser Ser Met Pro Glu Asp 1205 1210 1215 Arg Lys Ser Ser Thr Pro Gly Ser Leu His Glu Glu Thr Pro His 1220 1225 1230 Ser Arg Ser Thr Arg Glu Asn Thr Ser Arg Cys Ser Ala Pro Ala 1235 1240 1245 Thr Ala Leu Leu Ile Gln Lys Ala Leu Val Arg Gly Gly Leu Gly 1250 1255 1260 Thr Leu Ala Ala Asp Ala Asn Phe Ile Met Ala Thr Gly Gln Ala 1265 1270 1275 Leu Ala Asp Ala Cys Gln Met Glu Pro Glu Glu Val Glu Ile Met 1280 1285 1290 Ala Thr Glu Leu Leu Lys Gly Arg Glu Ala Pro Glu Gly Met Ala 1295 1300 1305 Ser Ser Leu Gly Cys Leu Asn Leu Gly Ser Ser Leu Gly Ser Leu 1310 1315 1320 Asp Gln His Gln Gly Ser Gln Glu Thr Leu Ile Pro Pro Arg Leu 1325 1330 1335 11 6160 DNA Homo sapiens gene (1)..(6160) 11 tcaggccggc agcggggagc cgagtggagg ctaattttac ttgctgggag cgaggagagt 60 aatcctcctg ccccccactc ctgccccgcc ccctggctgg ctcagcaggg cagctcagcc 120 gacagcctca gccagcctag tccccaaggc gggggcattg gggacacagg gaagggaaag 180 cactggggtg ggggagcagg agaaagccag attcccaggg aagcc atg gag cca tcc 237 Met Glu Pro Ser 1 tca ccc cag gat gaa ggc ctg agg aag aaa cag ccc aag aag cca gtt 285 Ser Pro Gln Asp Glu Gly Leu Arg Lys Lys Gln Pro Lys Lys Pro Val 5 10 15 20 cct gag att ctg cca agg cca ccc cgg gct ttg ttc tgc ctg acc ctg 333 Pro Glu Ile Leu Pro Arg Pro Pro Arg Ala Leu Phe Cys Leu Thr Leu 25 30 35 gag aac ccc ctg agg aag gcc tgc atc agc att gta gaa tgg aag ccc 381 Glu Asn Pro Leu Arg Lys Ala Cys Ile Ser Ile Val Glu Trp Lys Pro 40 45 50 ttc gag acg atc atc ttg ctc acc atc ttt gcc aat tgt gtg gcc ctg 429 Phe Glu Thr Ile Ile Leu Leu Thr Ile Phe Ala Asn Cys Val Ala Leu 55 60 65 gcc gtg tac ctg ccc atg ccg gaa gat gac aac aac tct ctg aac ctc 477 Ala Val Tyr Leu Pro Met Pro Glu Asp Asp Asn Asn Ser Leu Asn Leu 70 75 80 ggc ctg gag aag ctg gag tat ttc ttc ctc att gtc ttc tcg att gaa 525 Gly Leu Glu Lys Leu Glu Tyr Phe Phe Leu Ile Val Phe Ser Ile Glu 85 90 95 100 gcc gcc atg aag atc att gcc tac ggc ttc tta ttc cac cag gac gct 573 Ala Ala Met Lys Ile Ile Ala Tyr Gly Phe Leu Phe His Gln Asp Ala 105 110 115 tac ctg cgc agt ggc tgg aat gtg ctg gac ttc acc att gtc ttc ctg 621 Tyr Leu Arg Ser Gly Trp Asn Val Leu Asp Phe Thr Ile Val Phe Leu 120 125 130 ggg gtc ttc acc gtg att ctg gaa cag gtt aac gtc atc caa agc cac 669 Gly Val Phe Thr Val Ile Leu Glu Gln Val Asn Val Ile Gln Ser His 135 140 145 aca gcc cca atg agc agc aaa gga gcc ggc ttg gat gtc aag gcc ctc 717 Thr Ala Pro Met Ser Ser Lys Gly Ala Gly Leu Asp Val Lys Ala Leu 150 155 160 aga gcc ttc cga gtg ctc aga ccc ctc cgg ctg gtg tcg ggg gtg cct 765 Arg Ala Phe Arg Val Leu Arg Pro Leu Arg Leu Val Ser Gly Val Pro 165 170 175 180 agc ctg cag gtg gtc ctg aac tcc atc ttc aag gcc atg ctc ccc ctc 813 Ser Leu Gln Val Val Leu Asn Ser Ile Phe Lys Ala Met Leu Pro Leu 185 190 195 ttt cac atc gcc ctg ctg gtc ctc ttt atg gtc atc atc tat gcc atc 861 Phe His Ile Ala Leu Leu Val Leu Phe Met Val Ile Ile Tyr Ala Ile 200 205 210 atc ggg ctg gag ctc ttc aag ggc aag atg cac aag acc tgc tac ttc 909 Ile Gly Leu Glu Leu Phe Lys Gly Lys Met His Lys Thr Cys Tyr Phe 215 220 225 att ggt aca gat atc gtg gcc acg gtg gag aat gaa gag cca tcg ccc 957 Ile Gly Thr Asp Ile Val Ala Thr Val Glu Asn Glu Glu Pro Ser Pro 230 235 240 tgc gcc agg acg ggc tca ggg cgc cgg tgc acc atc aat ggc agt gag 1005 Cys Ala Arg Thr Gly Ser Gly Arg Arg Cys Thr Ile Asn Gly Ser Glu 245 250 255 260 tgc cgg ggc ggc tgc cca ggg ccc aac cat ggc atc acc cac ttc gac 1053 Cys Arg Gly Gly Cys Pro Gly Pro Asn His Gly Ile Thr His Phe Asp 265 270 275 aac ttc ggc ttc tcc atg ctc acc gtg tac cag tgc att acc atg gag 1101 Asn Phe Gly Phe Ser Met Leu Thr Val Tyr Gln Cys Ile Thr Met Glu 280 285 290 gga tgg act gac gtc ctt tac tgg gtc aat gat gcc atc ggg aat gag 1149 Gly Trp Thr Asp Val Leu Tyr Trp Val Asn Asp Ala Ile Gly Asn Glu 295 300 305 tgg ccc tgg atc tat ttt gtc acc ctc att ttg ctg gga tcc ttc ttc 1197 Trp Pro Trp Ile Tyr Phe Val Thr Leu Ile Leu Leu Gly Ser Phe Phe 310 315 320 atc ctc aac ctg gtg ctg ggt gtc ctg agt ggg gaa ttc acc aag gag 1245 Ile Leu Asn Leu Val Leu Gly Val Leu Ser Gly Glu Phe Thr Lys Glu 325 330 335 340 cgg gag aag gcc aag tcc agg gga acc ttc cag aag ctc cgg gag aag 1293 Arg Glu Lys Ala Lys Ser Arg Gly Thr Phe Gln Lys Leu Arg Glu Lys 345 350 355 cag caa cta gat gag gac ctt cgg ggc tac atg agc tgg atc acg cag 1341 Gln Gln Leu Asp Glu Asp Leu Arg Gly Tyr Met Ser Trp Ile Thr Gln 360 365 370 ggc gag gtc atg gat gtt gag gac ttc aga gaa gga aaa ctg tct ttg 1389 Gly Glu Val Met Asp Val Glu Asp Phe Arg Glu Gly Lys Leu Ser Leu 375 380 385 gat gaa ggt ggc tct gac aca gag agc ctg tat gaa att gca ggc ttg 1437 Asp Glu Gly Gly Ser Asp Thr Glu Ser Leu Tyr Glu Ile Ala Gly Leu 390 395 400 aac aaa atc atc cag ttc atc cga cat tgg agg cag tgg aac cgc atc 1485 Asn Lys Ile Ile Gln Phe Ile Arg His Trp Arg Gln Trp Asn Arg Ile 405 410 415 420 ttt cgc tgg aag tgc cat gac atc gtg aag tcc aag gtc ttc tat tgg 1533 Phe Arg Trp Lys Cys His Asp Ile Val Lys Ser Lys Val Phe Tyr Trp 425 430 435 ctg gtg att ctc atc gtt gcc ctc aac acc ctg tct atc gcc tca gag 1581 Leu Val Ile Leu Ile Val Ala Leu Asn Thr Leu Ser Ile Ala Ser Glu 440 445 450 cac cac aac cag ccg cac tgg ctg acc cgt ttg caa gac att gcc aac 1629 His His Asn Gln Pro His Trp Leu Thr Arg Leu Gln Asp Ile Ala Asn 455 460 465 cgg gtg ctg ctg tcc ctc ttc acc act gag atg ctg atg aag atg tac 1677 Arg Val Leu Leu Ser Leu Phe Thr Thr Glu Met Leu Met Lys Met Tyr 470 475 480 ggg ctg ggc ctg cgc cag tac ttc atg tct atc ttc aac cgc ttc gac 1725 Gly Leu Gly Leu Arg Gln Tyr Phe Met Ser Ile Phe Asn Arg Phe Asp 485 490 495 500 tgc ttc gtg gtg tgc agc ggt atc ctg gag atc ctg ctg gtg gag tcg 1773 Cys Phe Val Val Cys Ser Gly Ile Leu Glu Ile Leu Leu Val Glu Ser 505 510 515 ggc gcc atg aca ccc ctg ggc atc tcc gtg ctc cgc tgc atc cgc ctc 1821 Gly Ala Met Thr Pro Leu Gly Ile Ser Val Leu Arg Cys Ile Arg Leu 520 525 530 ctg agg atc ttc aag atc acc aaa tat tgg acg tcg ctg agc aac ctg 1869 Leu Arg Ile Phe Lys Ile Thr Lys Tyr Trp Thr Ser Leu Ser Asn Leu 535 540 545 gtg gca tcc ctg ctc aac tcc atc cgc tcc atc gcc tcc ctg ctg ctg 1917 Val Ala Ser Leu Leu Asn Ser Ile Arg Ser Ile Ala Ser Leu Leu Leu 550 555 560 ctg ctc ttc ctc ttc atc gtc atc ttc cgc ctc ctg ggc atg cag ctc 1965 Leu Leu Phe Leu Phe Ile Val Ile Phe Arg Leu Leu Gly Met Gln Leu 565 570 575 580 ttt ggg ggg agg tat gac ttt gaa gac aca gaa gta cgg cgc agc aac 2013 Phe Gly Gly Arg Tyr Asp Phe Glu Asp Thr Glu Val Arg Arg Ser Asn 585 590 595 ttt gac aac ttt ccc caa gcc ctc atc agc gtc ttc cag gta ctg aca 2061 Phe Asp Asn Phe Pro Gln Ala Leu Ile Ser Val Phe Gln Val Leu Thr 600 605 610 ggg gaa gac tgg acc tca atg atg tac aat ggg atc atg gcc tcg agc 2109 Gly Glu Asp Trp Thr Ser Met Met Tyr Asn Gly Ile Met Ala Ser Ser 615 620 625 ggg ccg tcc tac cct ggc atg ctt gtg tgc att tac ttc atc atc ctt 2157 Gly Pro Ser Tyr Pro Gly Met Leu Val Cys Ile Tyr Phe Ile Ile Leu 630 635 640 ttc gtc tgt ggc aac tac atc ctg ctc aat gtc ttc ctg gcc att gcc 2205 Phe Val Cys Gly Asn Tyr Ile Leu Leu Asn Val Phe Leu Ala Ile Ala 645 650 655 660 gtg gac aac ctg gcc gag gcg gag agc ctg act tct gcc cag aag gcc 2253 Val Asp Asn Leu Ala Glu Ala Glu Ser Leu Thr Ser Ala Gln Lys Ala 665 670 675 aag gct gag gag aaa aaa cgc agg aag atg tcc aag ggt ctc cca gac 2301 Lys Ala Glu Glu Lys Lys Arg Arg Lys Met Ser Lys Gly Leu Pro Asp 680 685 690 aag tca gaa gag gag aag tca acg atg gcc aag aag ctg gag cag aaa 2349 Lys Ser Glu Glu Glu Lys Ser Thr Met Ala Lys Lys Leu Glu Gln Lys 695 700 705 ccc aag ggt gag ggc atc ccc acc act gcc aag ctg aaa atc gat gag 2397 Pro Lys Gly Glu Gly Ile Pro Thr Thr Ala Lys Leu Lys Ile Asp Glu 710 715 720 ttt gaa tct aat gtc aat gag gtg aag gat ccc tac ccc tca gcc gac 2445 Phe Glu Ser Asn Val Asn Glu Val Lys Asp Pro Tyr Pro Ser Ala Asp 725 730 735 740 ttc cca ggg gat gac gag gaa gat gag cct gag atc ccg ctg agc ccc 2493 Phe Pro Gly Asp Asp Glu Glu Asp Glu Pro Glu Ile Pro Leu Ser Pro 745 750 755 cga cca cgt ccc ctg gct gag ctg cag ctg aaa gag aag gcc gtg ccc 2541 Arg Pro Arg Pro Leu Ala Glu Leu Gln Leu Lys Glu Lys Ala Val Pro 760 765 770 att cca gaa gcc agc tcc ttc ttc atc ttc agc ccc acc aat aag atc 2589 Ile Pro Glu Ala Ser Ser Phe Phe Ile Phe Ser Pro Thr Asn Lys Ile 775 780 785 cgt gtc ctg tgt cac cgc atc gtc aat gcc acc tgg ttc acc aac ttc 2637 Arg Val Leu Cys His Arg Ile Val Asn Ala Thr Trp Phe Thr Asn Phe 790 795 800 atc ctg ctc ttc atc ctg ctc agc agc gct gca ctg gct gcg gaa gac 2685 Ile Leu Leu Phe Ile Leu Leu Ser Ser Ala Ala Leu Ala Ala Glu Asp 805 810 815 820 ccc atc cgg gct gat tcc atg aga aat cag atc ctt aaa cac ttt gac 2733 Pro Ile Arg Ala Asp Ser Met Arg Asn Gln Ile Leu Lys His Phe Asp 825 830 835 atc ggg ttc acc tct gtc ttc act gtg gag att gtc ctc aag atg acg 2781 Ile Gly Phe Thr Ser Val Phe Thr Val Glu Ile Val Leu Lys Met Thr 840 845 850 acc tac gga gcc ttc ctg cac aag ggt tcc ttc tgc cgc aat tac ttc 2829 Thr Tyr Gly Ala Phe Leu His Lys Gly Ser Phe Cys Arg Asn Tyr Phe 855 860 865 aac atg ctg gac ctg ctg gtg gtg gcc gtg tcc ctc atc tcc atg gga 2877 Asn Met Leu Asp Leu Leu Val Val Ala Val Ser Leu Ile Ser Met Gly 870 875 880 ctt gag tcc agt gcc atc tcc gtg gtg aag atc ctg agg gtg ctg agg 2925 Leu Glu Ser Ser Ala Ile Ser Val Val Lys Ile Leu Arg Val Leu Arg 885 890 895 900 gtg ctc cga cca ctc aga gcc atc aac aga gcc aag ggg ttg aag cac 2973 Val Leu Arg Pro Leu Arg Ala Ile Asn Arg Ala Lys Gly Leu Lys His 905 910 915 gtg gct agg tgc atg ttc gtg gcc atc agc acc atc ggg aac atc gtg 3021 Val Ala Arg Cys Met Phe Val Ala Ile Ser Thr Ile Gly Asn Ile Val 920 925 930 ctg gtc act acc ctc cta cag ttc atg ttt gcc tgc atc ggc gtc cag 3069 Leu Val Thr Thr Leu Leu Gln Phe Met Phe Ala Cys Ile Gly Val Gln 935 940 945 ctc ttc aag ggg aag ttc ttc agg tgc acc gac ttg tcc aag atg aca 3117 Leu Phe Lys Gly Lys Phe Phe Arg Cys Thr Asp Leu Ser Lys Met Thr 950 955 960 gag gag gag tgc agg ggc tac tac tac gtg tac aag gac ggg gac ccc 3165 Glu Glu Glu Cys Arg Gly Tyr Tyr Tyr Val Tyr Lys Asp Gly Asp Pro 965 970 975 980 atg cag ata gag ctg cgt cac cgc gag tgg gta cac agc gac ttc cac 3213 Met Gln Ile Glu Leu Arg His Arg Glu Trp Val His Ser Asp Phe His 985 990 995 ttc gac aat gtg ctc tca gcc atg atg tcc ctc ttc acg gtc tcc 3258 Phe Asp Asn Val Leu Ser Ala Met Met Ser Leu Phe Thr Val Ser 1000 1005 1010 acc ttc gag gga tgg cct cag ctg ctg tac aag gcc ata gac tcc 3303 Thr Phe Glu Gly Trp Pro Gln Leu Leu Tyr Lys Ala Ile Asp Ser 1015 1020 1025 aat gcg gag gac gtg ggt ccc atc tac aac aac cgt gtg gag atg 3348 Asn Ala Glu Asp Val Gly Pro Ile Tyr Asn Asn Arg Val Glu Met 1030 1035 1040 gcc atc ttc ttc atc atc tac atc atc ctc att gcc ttc ttc atg 3393 Ala Ile Phe Phe Ile Ile Tyr Ile Ile Leu Ile Ala Phe Phe Met 1045 1050 1055 atg aac atc ttt gtg ggc ttc gtc att gtc acc ttc cag gag cag 3438 Met Asn Ile Phe Val Gly Phe Val Ile Val Thr Phe Gln Glu Gln 1060 1065 1070 gga gag act gag tac aag aac tgt gag ctg gac aag aac cag cgc 3483 Gly Glu Thr Glu Tyr Lys Asn Cys Glu Leu Asp Lys Asn Gln Arg 1075 1080 1085 caa tgt gta cag tat gcc ctg aag gcc cgc cca ctg agg tgc tac 3528 Gln Cys Val Gln Tyr Ala Leu Lys Ala Arg Pro Leu Arg Cys Tyr 1090 1095 1100 att ccc aaa aac cca tac cag tac cag gtg tgg tac att gtc acc 3573 Ile Pro Lys Asn Pro Tyr Gln Tyr Gln Val Trp Tyr Ile Val Thr 1105 1110 1115 tcc tcc tac ttt gaa tac ctg atg ttt gcc ctc atc atg ctc aac 3618 Ser Ser Tyr Phe Glu Tyr Leu Met Phe Ala Leu Ile Met Leu Asn 1120 1125 1130 acc atc tgc ctc ggc atg cag cac tac aac cag tcg gag cag atg 3663 Thr Ile Cys Leu Gly Met Gln His Tyr Asn Gln Ser Glu Gln Met 1135 1140 1145 aac cac atc tca gac atc ctc aat gtg gcc ttc act atc atc ttc 3708 Asn His Ile Ser Asp Ile Leu Asn Val Ala Phe Thr Ile Ile Phe 1150 1155 1160 acc ctg gag atg atc ctc aag ctc atg gcc ttc aag gcc agg ggc 3753 Thr Leu Glu Met Ile Leu Lys Leu Met Ala Phe Lys Ala Arg Gly 1165 1170 1175 tac ttt gga aac ccc tgg aat gtg ttt gac ttc ctg att gtc att 3798 Tyr Phe Gly Asn Pro Trp Asn Val Phe Asp Phe Leu Ile Val Ile 1180 1185 1190 ggc agc atc att gat gtc atc ctc agt gag atc gac act ttc ctg 3843 Gly Ser Ile Ile Asp Val Ile Leu Ser Glu Ile Asp Thr Phe Leu 1195 1200 1205 gcc tcc agc ggg gga ctg tat tgc ctg ggt gga ggc tgc ggg aac 3888 Ala Ser Ser Gly Gly Leu Tyr Cys Leu Gly Gly Gly Cys Gly Asn 1210 1215 1220 gtt gac cca gat gag agt gcc cgc atc tcc agc gcc ttc ttc cgc 3933 Val Asp Pro Asp Glu Ser Ala Arg Ile Ser Ser Ala Phe Phe Arg 1225 1230 1235 ctg ttc cgt gtc atg agg ctg atc aag ctg ctg agc cgg gca gaa 3978 Leu Phe Arg Val Met Arg Leu Ile Lys Leu Leu Ser Arg Ala Glu 1240 1245 1250 gga gtg cga acc ctc ctg tgg acg ttc atc aag tcc ttc cag gcc 4023 Gly Val Arg Thr Leu Leu Trp Thr Phe Ile Lys Ser Phe Gln Ala 1255 1260 1265 cta ccc tac gtg gct ctg ctc atc gtc atg ctc ttc ttc atc tac 4068 Leu Pro Tyr Val Ala Leu Leu Ile Val Met Leu Phe Phe Ile Tyr 1270 1275 1280 gct gtc atc ggc atg cag atg ttt ggg aag atc gcc ttg gtg gat 4113 Ala Val Ile Gly Met Gln Met Phe Gly Lys Ile Ala Leu Val Asp 1285 1290 1295 ggg acc caa ata aac cgg aac aac aac ttc cag acc ttc cca caa 4158 Gly Thr Gln Ile Asn Arg Asn Asn Asn Phe Gln Thr Phe Pro Gln 1300 1305 1310 gct gtg cta ctg ctc ttc agg tgt gca aca ggt gag gcc tgg cag 4203 Ala Val Leu Leu Leu Phe Arg Cys Ala Thr Gly Glu Ala Trp Gln 1315 1320 1325 gag atc cta ctg gcc tgc agc tat ggg aag ctg tgt gac cca gag 4248 Glu Ile Leu Leu Ala Cys Ser Tyr Gly Lys Leu Cys Asp Pro Glu 1330 1335 1340 tcg gac tat gcc cca ggg gag gag tac aca tgt ggc acc aac ttt 4293 Ser Asp Tyr Ala Pro Gly Glu Glu Tyr Thr Cys Gly Thr Asn Phe 1345 1350 1355 gca tac tac tac ttc atc agc ttc tac atg ctc tgt gcc ttc ctg 4338 Ala Tyr Tyr Tyr Phe Ile Ser Phe Tyr Met Leu Cys Ala Phe Leu 1360 1365 1370 gtc atc aac ctc ttt gtg gct gtc atc atg gac aat ttt gac tac 4383 Val Ile Asn Leu Phe Val Ala Val Ile Met Asp Asn Phe Asp Tyr 1375 1380 1385 ctc acc cgg gac tgg tcc atc ctg ggc cct cat cac ctg gat gag 4428 Leu Thr Arg Asp Trp Ser Ile Leu Gly Pro His His Leu Asp Glu 1390 1395 1400 ttc aag gcc atc tgg gca gag tat gac cca gag gct aag ggg agg 4473 Phe Lys Ala Ile Trp Ala Glu Tyr Asp Pro Glu Ala Lys Gly Arg 1405 1410 1415 atc aaa cac ctg gac gtg gtg acc ctg ctg aga agg att cag ccc 4518 Ile Lys His Leu Asp Val Val Thr Leu Leu Arg Arg Ile Gln Pro 1420 1425 1430 cct ctg ggc ttt ggg aag ttc tgc cca cat cgg gta gct tgt aag 4563 Pro Leu Gly Phe Gly Lys Phe Cys Pro His Arg Val Ala Cys Lys 1435 1440 1445 cgg ctg gtg ggc atg aac atg ccc ctg aac agc gac ggc aca gtc 4608 Arg Leu Val Gly Met Asn Met Pro Leu Asn Ser Asp Gly Thr Val 1450 1455 1460 acc ttc aat gcc aca ctc ttt gcc ctg gtc cgc acg gca ctc aag 4653 Thr Phe Asn Ala Thr Leu Phe Ala Leu Val Arg Thr Ala Leu Lys 1465 1470 1475 atc aag acg gaa ggt aac ttt gag cag gcc aac gag gag ctg agg 4698 Ile Lys Thr Glu Gly Asn Phe Glu Gln Ala Asn Glu Glu Leu Arg 1480 1485 1490 gcc atc atc aag aag atc tgg aag aga acc agc atg aag ctc ttg 4743 Ala Ile Ile Lys Lys Ile Trp Lys Arg Thr Ser Met Lys Leu Leu 1495 1500 1505 gac cag gtc atc cct cca ata gga gat gat gag gtg aca gtg ggg 4788 Asp Gln Val Ile Pro Pro Ile Gly Asp Asp Glu Val Thr Val Gly 1510 1515 1520 aag ttc tac gcc aca ttc ctc atc cag gag cac ttc cgg aag ttc 4833 Lys Phe Tyr Ala Thr Phe Leu Ile Gln Glu His Phe Arg Lys Phe 1525 1530 1535 atg aaa cgc caa gag gag tat tat ggc tat cgg ccc aag aag gac 4878 Met Lys Arg Gln Glu Glu Tyr Tyr Gly Tyr Arg Pro Lys Lys Asp 1540 1545 1550 att gta cag atc cag gca ggg ctg cgg acc att gag gaa gag gca 4923 Ile Val Gln Ile Gln Ala Gly Leu Arg Thr Ile Glu Glu Glu Ala 1555 1560 1565 gcc ccc gag atc tgt cgc acg gtc tca gga gac ctg gct gct gag 4968 Ala Pro Glu Ile Cys Arg Thr Val Ser Gly Asp Leu Ala Ala Glu 1570 1575 1580 gag gag ctg gag aga gcc atg gtg gag gct gcg atg gag gag ggg 5013 Glu Glu Leu Glu Arg Ala Met Val Glu Ala Ala Met Glu Glu Gly 1585 1590 1595 ata ttc cgg agg act gga ggc ctg ttt ggc cag gtg gac aac ttc 5058 Ile Phe Arg Arg Thr Gly Gly Leu Phe Gly Gln Val Asp Asn Phe 1600 1605 1610 ctg gaa agg acc aac tcc ctg ccc cct gtc atg gcc aat cag aga 5103 Leu Glu Arg Thr Asn Ser Leu Pro Pro Val Met Ala Asn Gln Arg 1615 1620 1625 ccc ctc cag ttt gct gag ata gag atg gaa gag atg gag tca cct 5148 Pro Leu Gln Phe Ala Glu Ile Glu Met Glu Glu Met Glu Ser Pro 1630 1635 1640 gtc ttc ttg gag gac ttc cca caa gat cca cgc acc aac ccc ctg 5193 Val Phe Leu Glu Asp Phe Pro Gln Asp Pro Arg Thr Asn Pro Leu 1645 1650 1655 gct cgt gcc aat acc aac aat gcc aac gcc aat gtc gcc tat gcg 5238 Ala Arg Ala Asn Thr Asn Asn Ala Asn Ala Asn Val Ala Tyr Ala 1660 1665 1670 aac agc aac cat agc aac agc cat gtg ttt tcc agt gtc cac tat 5283 Asn Ser Asn His Ser Asn Ser His Val Phe Ser Ser Val His Tyr 1675 1680 1685 gaa agg gag ttc cca gaa gag aca gag acg cct gct acc aga gga 5328 Glu Arg Glu Phe Pro Glu Glu Thr Glu Thr Pro Ala Thr Arg Gly 1690 1695 1700 cga gcc ctt ggc caa ccc tgc agg tcc ctg gga ccc cac agc aaa 5373 Arg Ala Leu Gly Gln Pro Cys Arg Ser Leu Gly Pro His Ser Lys 1705 1710 1715 ccc tgt gtg gag atg ctg aag gga ctg ctg acc cag agg gca atg 5418 Pro Cys Val Glu Met Leu Lys Gly Leu Leu Thr Gln Arg Ala Met 1720 1725 1730 ccc aga ggc cag gca cct cct gcc ccc tgc cag tgc ccc agg gtg 5463 Pro Arg Gly Gln Ala Pro Pro Ala Pro Cys Gln Cys Pro Arg Val 1735 1740 1745 gag tcc tcc atg cct gag gac aga aag agc tcc aca cca ggg tct 5508 Glu Ser Ser Met Pro Glu Asp Arg Lys Ser Ser Thr Pro Gly Ser 1750 1755 1760 ctt cat gag gag aca ccc cac agc agg agc acc agg gag aat act 5553 Leu His Glu Glu Thr Pro His Ser Arg Ser Thr Arg Glu Asn Thr 1765 1770 1775 tcc agg tgc tca gca cca gct aca gcc ctg ctg atc caa aag gct 5598 Ser Arg Cys Ser Ala Pro Ala Thr Ala Leu Leu Ile Gln Lys Ala 1780 1785 1790 ctg gtt cga ggg ggc ctg ggc acc ttg gca gct gat gca aac ttc 5643 Leu Val Arg Gly Gly Leu Gly Thr Leu Ala Ala Asp Ala Asn Phe 1795 1800 1805 atc atg gca aca ggc cag gcc ctc gga gat gcc tgc caa atg gaa 5688 Ile Met Ala Thr Gly Gln Ala Leu Gly Asp Ala Cys Gln Met Glu 1810 1815 1820 cca gag gaa gtg gag atc atg gca aca gag cta ctg aaa gga cga 5733 Pro Glu Glu Val Glu Ile Met Ala Thr Glu Leu Leu Lys Gly Arg 1825 1830 1835 gag gcc cca gac ggc atg gcc agc tcc ctg gga tgc ctg aac ctc 5778 Glu Ala Pro Asp Gly Met Ala Ser Ser Leu Gly Cys Leu Asn Leu 1840 1845 1850 ggg tcc tcc ctg ggc agc ctc gac caa cac cag ggc tcc cag gag 5823 Gly Ser Ser Leu Gly Ser Leu Asp Gln His Gln Gly Ser Gln Glu 1855 1860 1865 acc ctt att cct cca agg ctg tga tgcccacaca gcatcagcat gggcttagag 5877 Thr Leu Ile Pro Pro Arg Leu 1870 ctggcatgac caatgggggt ggggaagttg ctggggtgga gaagggctag cccaccgcag 5937 cagcctccct ccctctcagc agctagatgc atgcctgagg cagggtggtc aggaaccacc 5997 tcaaaaagtg cggaggaagt agctggacag gccctgcccc tcaccagcaa gaggcatgat 6057 tggatggagc ttctaatgtc attcaaaaag gcctggtcag tgcctgtccc tagggccact 6117 cccacctgca ggacattaaa atctccaggc ctgtgacact ggc 6160 12 1873 PRT Homo sapiens 12 Met Glu Pro Ser Ser Pro Gln Asp Glu Gly Leu Arg Lys Lys Gln Pro 1 5 10 15 Lys Lys Pro Val Pro Glu Ile Leu Pro Arg Pro Pro Arg Ala Leu Phe 20 25 30 Cys Leu Thr Leu Glu Asn Pro Leu Arg Lys Ala Cys Ile Ser Ile Val 35 40 45 Glu Trp Lys Pro Phe Glu Thr Ile Ile Leu Leu Thr Ile Phe Ala Asn 50 55 60 Cys Val Ala Leu Ala Val Tyr Leu Pro Met Pro Glu Asp Asp Asn Asn 65 70 75 80 Ser Leu Asn Leu Gly Leu Glu Lys Leu Glu Tyr Phe Phe Leu Ile Val 85 90 95 Phe Ser Ile Glu Ala Ala Met Lys Ile Ile Ala Tyr Gly Phe Leu Phe 100 105 110 His Gln Asp Ala Tyr Leu Arg Ser Gly Trp Asn Val Leu Asp Phe Thr 115 120 125 Ile Val Phe Leu Gly Val Phe Thr Val Ile Leu Glu Gln Val Asn Val 130 135 140 Ile Gln Ser His Thr Ala Pro Met Ser Ser Lys Gly Ala Gly Leu Asp 145 150 155 160 Val Lys Ala Leu Arg Ala Phe Arg Val Leu Arg Pro Leu Arg Leu Val 165 170 175 Ser Gly Val Pro Ser Leu Gln Val Val Leu Asn Ser Ile Phe Lys Ala 180 185 190 Met Leu Pro Leu Phe His Ile Ala Leu Leu Val Leu Phe Met Val Ile 195 200 205 Ile Tyr Ala Ile Ile Gly Leu Glu Leu Phe Lys Gly Lys Met His Lys 210 215 220 Thr Cys Tyr Phe Ile Gly Thr Asp Ile Val Ala Thr Val Glu Asn Glu 225 230 235 240 Glu Pro Ser Pro Cys Ala Arg Thr Gly Ser Gly Arg Arg Cys Thr Ile 245 250 255 Asn Gly Ser Glu Cys Arg Gly Gly Cys Pro Gly Pro Asn His Gly Ile 260 265 270 Thr His Phe Asp Asn Phe Gly Phe Ser Met Leu Thr Val Tyr Gln Cys 275 280 285 Ile Thr Met Glu Gly Trp Thr Asp Val Leu Tyr Trp Val Asn Asp Ala 290 295 300 Ile Gly Asn Glu Trp Pro Trp Ile Tyr Phe Val Thr Leu Ile Leu Leu 305 310 315 320 Gly Ser Phe Phe Ile Leu Asn Leu Val Leu Gly Val Leu Ser Gly Glu 325 330 335 Phe Thr Lys Glu Arg Glu Lys Ala Lys Ser Arg Gly Thr Phe Gln Lys 340 345 350 Leu Arg Glu Lys Gln Gln Leu Asp Glu Asp Leu Arg Gly Tyr Met Ser 355 360 365 Trp Ile Thr Gln Gly Glu Val Met Asp Val Glu Asp Phe Arg Glu Gly 370 375 380 Lys Leu Ser Leu Asp Glu Gly Gly Ser Asp Thr Glu Ser Leu Tyr Glu 385 390 395 400 Ile Ala Gly Leu Asn Lys Ile Ile Gln Phe Ile Arg His Trp Arg Gln 405 410 415 Trp Asn Arg Ile Phe Arg Trp Lys Cys His Asp Ile Val Lys Ser Lys 420 425 430 Val Phe Tyr Trp Leu Val Ile Leu Ile Val Ala Leu Asn Thr Leu Ser 435 440 445 Ile Ala Ser Glu His His Asn Gln Pro His Trp Leu Thr Arg Leu Gln 450 455 460 Asp Ile Ala Asn Arg Val Leu Leu Ser Leu Phe Thr Thr Glu Met Leu 465 470 475 480 Met Lys Met Tyr Gly Leu Gly Leu Arg Gln Tyr Phe Met Ser Ile Phe 485 490 495 Asn Arg Phe Asp Cys Phe Val Val Cys Ser Gly Ile Leu Glu Ile Leu 500 505 510 Leu Val Glu Ser Gly Ala Met Thr Pro Leu Gly Ile Ser Val Leu Arg 515 520 525 Cys Ile Arg Leu Leu Arg Ile Phe Lys Ile Thr Lys Tyr Trp Thr Ser 530 535 540 Leu Ser Asn Leu Val Ala Ser Leu Leu Asn Ser Ile Arg Ser Ile Ala 545 550 555 560 Ser Leu Leu Leu Leu Leu Phe Leu Phe Ile Val Ile Phe Arg Leu Leu 565 570 575 Gly Met Gln Leu Phe Gly Gly Arg Tyr Asp Phe Glu Asp Thr Glu Val 580 585 590 Arg Arg Ser Asn Phe Asp Asn Phe Pro Gln Ala Leu Ile Ser Val Phe 595 600 605 Gln Val Leu Thr Gly Glu Asp Trp Thr Ser Met Met Tyr Asn Gly Ile 610 615 620 Met Ala Ser Ser Gly Pro Ser Tyr Pro Gly Met Leu Val Cys Ile Tyr 625 630 635 640 Phe Ile Ile Leu Phe Val Cys Gly Asn Tyr Ile Leu Leu Asn Val Phe 645 650 655 Leu Ala Ile Ala Val Asp Asn Leu Ala Glu Ala Glu Ser Leu Thr Ser 660 665 670 Ala Gln Lys Ala Lys Ala Glu Glu Lys Lys Arg Arg Lys Met Ser Lys 675 680 685 Gly Leu Pro Asp Lys Ser Glu Glu Glu Lys Ser Thr Met Ala Lys Lys 690 695 700 Leu Glu Gln Lys Pro Lys Gly Glu Gly Ile Pro Thr Thr Ala Lys Leu 705 710 715 720 Lys Ile Asp Glu Phe Glu Ser Asn Val Asn Glu Val Lys Asp Pro Tyr 725 730 735 Pro Ser Ala Asp Phe Pro Gly Asp Asp Glu Glu Asp Glu Pro Glu Ile 740 745 750 Pro Leu Ser Pro Arg Pro Arg Pro Leu Ala Glu Leu Gln Leu Lys Glu 755 760 765 Lys Ala Val Pro Ile Pro Glu Ala Ser Ser Phe Phe Ile Phe Ser Pro 770 775 780 Thr Asn Lys Ile Arg Val Leu Cys His Arg Ile Val Asn Ala Thr Trp 785 790 795 800 Phe Thr Asn Phe Ile Leu Leu Phe Ile Leu Leu Ser Ser Ala Ala Leu 805 810 815 Ala Ala Glu Asp Pro Ile Arg Ala Asp Ser Met Arg Asn Gln Ile Leu 820 825 830 Lys His Phe Asp Ile Gly Phe Thr Ser Val Phe Thr Val Glu Ile Val 835 840 845 Leu Lys Met Thr Thr Tyr Gly Ala Phe Leu His Lys Gly Ser Phe Cys 850 855 860 Arg Asn Tyr Phe Asn Met Leu Asp Leu Leu Val Val Ala Val Ser Leu 865 870 875 880 Ile Ser Met Gly Leu Glu Ser Ser Ala Ile Ser Val Val Lys Ile Leu 885 890 895 Arg Val Leu Arg Val Leu Arg Pro Leu Arg Ala Ile Asn Arg Ala Lys 900 905 910 Gly Leu Lys His Val Ala Arg Cys Met Phe Val Ala Ile Ser Thr Ile 915 920 925 Gly Asn Ile Val Leu Val Thr Thr Leu Leu Gln Phe Met Phe Ala Cys 930 935 940 Ile Gly Val Gln Leu Phe Lys Gly Lys Phe Phe Arg Cys Thr Asp Leu 945 950 955 960 Ser Lys Met Thr Glu Glu Glu Cys Arg Gly Tyr Tyr Tyr Val Tyr Lys 965 970 975 Asp Gly Asp Pro Met Gln Ile Glu Leu Arg His Arg Glu Trp Val His 980 985 990 Ser Asp Phe His Phe Asp Asn Val Leu Ser Ala Met Met Ser Leu Phe 995 1000 1005 Thr Val Ser Thr Phe Glu Gly Trp Pro Gln Leu Leu Tyr Lys Ala 1010 1015 1020 Ile Asp Ser Asn Ala Glu Asp Val Gly Pro Ile Tyr Asn Asn Arg 1025 1030 1035 Val Glu Met Ala Ile Phe Phe Ile Ile Tyr Ile Ile Leu Ile Ala 1040 1045 1050 Phe Phe Met Met Asn Ile Phe Val Gly Phe Val Ile Val Thr Phe 1055 1060 1065 Gln Glu Gln Gly Glu Thr Glu Tyr Lys Asn Cys Glu Leu Asp Lys 1070 1075 1080 Asn Gln Arg Gln Cys Val Gln Tyr Ala Leu Lys Ala Arg Pro Leu 1085 1090 1095 Arg Cys Tyr Ile Pro Lys Asn Pro Tyr Gln Tyr Gln Val Trp Tyr 1100 1105 1110 Ile Val Thr Ser Ser Tyr Phe Glu Tyr Leu Met Phe Ala Leu Ile 1115 1120 1125 Met Leu Asn Thr Ile Cys Leu Gly Met Gln His Tyr Asn Gln Ser 1130 1135 1140 Glu Gln Met Asn His Ile Ser Asp Ile Leu Asn Val Ala Phe Thr 1145 1150 1155 Ile Ile Phe Thr Leu Glu Met Ile Leu Lys Leu Met Ala Phe Lys 1160 1165 1170 Ala Arg Gly Tyr Phe Gly Asn Pro Trp Asn Val Phe Asp Phe Leu 1175 1180 1185 Ile Val Ile Gly Ser Ile Ile Asp Val Ile Leu Ser Glu Ile Asp 1190 1195 1200 Thr Phe Leu Ala Ser Ser Gly Gly Leu Tyr Cys Leu Gly Gly Gly 1205 1210 1215 Cys Gly Asn Val Asp Pro Asp Glu Ser Ala Arg Ile Ser Ser Ala 1220 1225 1230 Phe Phe Arg Leu Phe Arg Val Met Arg Leu Ile Lys Leu Leu Ser 1235 1240 1245 Arg Ala Glu Gly Val Arg Thr Leu Leu Trp Thr Phe Ile Lys Ser 1250 1255 1260 Phe Gln Ala Leu Pro Tyr Val Ala Leu Leu Ile Val Met Leu Phe 1265 1270 1275 Phe Ile Tyr Ala Val Ile Gly Met Gln Met Phe Gly Lys Ile Ala 1280 1285 1290 Leu Val Asp Gly Thr Gln Ile Asn Arg Asn Asn Asn Phe Gln Thr 1295 1300 1305 Phe Pro Gln Ala Val Leu Leu Leu Phe Arg Cys Ala Thr Gly Glu 1310 1315 1320 Ala Trp Gln Glu Ile Leu Leu Ala Cys Ser Tyr Gly Lys Leu Cys 1325 1330 1335 Asp Pro Glu Ser Asp Tyr Ala Pro Gly Glu Glu Tyr Thr Cys Gly 1340 1345 1350 Thr Asn Phe Ala Tyr Tyr Tyr Phe Ile Ser Phe Tyr Met Leu Cys 1355 1360 1365 Ala Phe Leu Val Ile Asn Leu Phe Val Ala Val Ile Met Asp Asn 1370 1375 1380 Phe Asp Tyr Leu Thr Arg Asp Trp Ser Ile Leu Gly Pro His His 1385 1390 1395 Leu Asp Glu Phe Lys Ala Ile Trp Ala Glu Tyr Asp Pro Glu Ala 1400 1405 1410 Lys Gly Arg Ile Lys His Leu Asp Val Val Thr Leu Leu Arg Arg 1415 1420 1425 Ile Gln Pro Pro Leu Gly Phe Gly Lys Phe Cys Pro His Arg Val 1430 1435 1440 Ala Cys Lys Arg Leu Val Gly Met Asn Met Pro Leu Asn Ser Asp 1445 1450 1455 Gly Thr Val Thr Phe Asn Ala Thr Leu Phe Ala Leu Val Arg Thr 1460 1465 1470 Ala Leu Lys Ile Lys Thr Glu Gly Asn Phe Glu Gln Ala Asn Glu 1475 1480 1485 Glu Leu Arg Ala Ile Ile Lys Lys Ile Trp Lys Arg Thr Ser Met 1490 1495 1500 Lys Leu Leu Asp Gln Val Ile Pro Pro Ile Gly Asp Asp Glu Val 1505 1510 1515 Thr Val Gly Lys Phe Tyr Ala Thr Phe Leu Ile Gln Glu His Phe 1520 1525 1530 Arg Lys Phe Met Lys Arg Gln Glu Glu Tyr Tyr Gly Tyr Arg Pro 1535 1540 1545 Lys Lys Asp Ile Val Gln Ile Gln Ala Gly Leu Arg Thr Ile Glu 1550 1555 1560 Glu Glu Ala Ala Pro Glu Ile Cys Arg Thr Val Ser Gly Asp Leu 1565 1570 1575 Ala Ala Glu Glu Glu Leu Glu Arg Ala Met Val Glu Ala Ala Met 1580 1585 1590 Glu Glu Gly Ile Phe Arg Arg Thr Gly Gly Leu Phe Gly Gln Val 1595 1600 1605 Asp Asn Phe Leu Glu Arg Thr Asn Ser Leu Pro Pro Val Met Ala 1610 1615 1620 Asn Gln Arg Pro Leu Gln Phe Ala Glu Ile Glu Met Glu Glu Met 1625 1630 1635 Glu Ser Pro Val Phe Leu Glu Asp Phe Pro Gln Asp Pro Arg Thr 1640 1645 1650 Asn Pro Leu Ala Arg Ala Asn Thr Asn Asn Ala Asn Ala Asn Val 1655 1660 1665 Ala Tyr Ala Asn Ser Asn His Ser Asn Ser His Val Phe Ser Ser 1670 1675 1680 Val His Tyr Glu Arg Glu Phe Pro Glu Glu Thr Glu Thr Pro Ala 1685 1690 1695 Thr Arg Gly Arg Ala Leu Gly Gln Pro Cys Arg Ser Leu Gly Pro 1700 1705 1710 His Ser Lys Pro Cys Val Glu Met Leu Lys Gly Leu Leu Thr Gln 1715 1720 1725 Arg Ala Met Pro Arg Gly Gln Ala Pro Pro Ala Pro Cys Gln Cys 1730 1735 1740 Pro Arg Val Glu Ser Ser Met Pro Glu Asp Arg Lys Ser Ser Thr 1745 1750 1755 Pro Gly Ser Leu His Glu Glu Thr Pro His Ser Arg Ser Thr Arg 1760 1765 1770 Glu Asn Thr Ser Arg Cys Ser Ala Pro Ala Thr Ala Leu Leu Ile 1775 1780 1785 Gln Lys Ala Leu Val Arg Gly Gly Leu Gly Thr Leu Ala Ala Asp 1790 1795 1800 Ala Asn Phe Ile Met Ala Thr Gly Gln Ala Leu Gly Asp Ala Cys 1805 1810 1815 Gln Met Glu Pro Glu Glu Val Glu Ile Met Ala Thr Glu Leu Leu 1820 1825 1830 Lys Gly Arg Glu Ala Pro Asp Gly Met Ala Ser Ser Leu Gly Cys 1835 1840 1845 Leu Asn Leu Gly Ser Ser Leu Gly Ser Leu Asp Gln His Gln Gly 1850 1855 1860 Ser Gln Glu Thr Leu Ile Pro Pro Arg Leu 1865 1870 13 6160 DNA Homo sapiens gene (1)..(6160) 13 tcaggccggc agcggggagc cgagtggagg ctaattttac ttgctgggag cgaggagagt 60 aatcctcctg ccccccactc ctgccccgcc ccctggctgg ctcagcaggg cagctcagcc 120 gacagcctca gccagcctag tccccaaggc gggggcattg gggacacagg gaagggaaag 180 cactggggtg ggggagcagg agaaagccag attcccaggg aagcc atg gag cca tcc 237 Met Glu Pro Ser 1 tca ccc cag gat gaa ggc ctg agg aag aaa cag ccc aag aag cca gtt 285 Ser Pro Gln Asp Glu Gly Leu Arg Lys Lys Gln Pro Lys Lys Pro Val 5 10 15 20 cct gag att ctg cca agg cca ccc cgg gct ttg ttc tgc ctg acc ctg 333 Pro Glu Ile Leu Pro Arg Pro Pro Arg Ala Leu Phe Cys Leu Thr Leu 25 30 35 gag aac ccc ctg agg aag gcc tgc atc agc att gta gaa tgg aag ccc 381 Glu Asn Pro Leu Arg Lys Ala Cys Ile Ser Ile Val Glu Trp Lys Pro 40 45 50 ttc gag acg atc atc ttg ctc acc atc ttt gcc aat tgt gtg gcc ctg 429 Phe Glu Thr Ile Ile Leu Leu Thr Ile Phe Ala Asn Cys Val Ala Leu 55 60 65 gcc gtg tac ctg ccc atg ccg gaa gat gac aac aac tct ctg aac ctc 477 Ala Val Tyr Leu Pro Met Pro Glu Asp Asp Asn Asn Ser Leu Asn Leu 70 75 80 ggc ctg gag aag ctg gag tat ttc ttc ctc att gtc ttc tcg att gaa 525 Gly Leu Glu Lys Leu Glu Tyr Phe Phe Leu Ile Val Phe Ser Ile Glu 85 90 95 100 gcc gcc atg aag atc att gcc tac ggc ttc tta ttc cac cag gac gct 573 Ala Ala Met Lys Ile Ile Ala Tyr Gly Phe Leu Phe His Gln Asp Ala 105 110 115 tac ctg cgc agt ggc tgg aat gtg ctg gac ttc acc att gtc ttc ctg 621 Tyr Leu Arg Ser Gly Trp Asn Val Leu Asp Phe Thr Ile Val Phe Leu 120 125 130 ggg gtc ttc acc gtg att ctg gaa cag gtt aac gtc atc caa agc cac 669 Gly Val Phe Thr Val Ile Leu Glu Gln Val Asn Val Ile Gln Ser His 135 140 145 aca gcc cca atg agc agc aaa gga gcc ggc ttg gat gtc aag gcc ctc 717 Thr Ala Pro Met Ser Ser Lys Gly Ala Gly Leu Asp Val Lys Ala Leu 150 155 160 aga gcc ttc cga gtg ctc aga ccc ctc cgg ctg gtg tcg ggg gtg cct 765 Arg Ala Phe Arg Val Leu Arg Pro Leu Arg Leu Val Ser Gly Val Pro 165 170 175 180 agc ctg cag gtg gtc ctg aac tcc atc ttc aag gcc atg ctc ccc ctc 813 Ser Leu Gln Val Val Leu Asn Ser Ile Phe Lys Ala Met Leu Pro Leu 185 190 195 ttt cac atc gcc ctg ctg gtc ctc ttt atg gtc atc atc tat gcc atc 861 Phe His Ile Ala Leu Leu Val Leu Phe Met Val Ile Ile Tyr Ala Ile 200 205 210 atc ggg ctg gag ctc ttc aag ggc aag atg cac aag acc tgc tac ttc 909 Ile Gly Leu Glu Leu Phe Lys Gly Lys Met His Lys Thr Cys Tyr Phe 215 220 225 att ggt aca gat atc gtg gcc acg gtg gag aat gaa gag cca tcg ccc 957 Ile Gly Thr Asp Ile Val Ala Thr Val Glu Asn Glu Glu Pro Ser Pro 230 235 240 tgc gcc agg acg ggc tca ggg cgc cgg tgc acc atc aat ggc agt gag 1005 Cys Ala Arg Thr Gly Ser Gly Arg Arg Cys Thr Ile Asn Gly Ser Glu 245 250 255 260 tgc cgg ggc ggc tgc cca ggg ccc aac cat ggc atc acc cac ttc gac 1053 Cys Arg Gly Gly Cys Pro Gly Pro Asn His Gly Ile Thr His Phe Asp 265 270 275 aac ttc ggc ttc tcc atg ctc acc gtg tac cag tgc att acc atg gag 1101 Asn Phe Gly Phe Ser Met Leu Thr Val Tyr Gln Cys Ile Thr Met Glu 280 285 290 gga tgg act gac gtc ctt tac tgg gtc aat gat gcc atc ggg aat gag 1149 Gly Trp Thr Asp Val Leu Tyr Trp Val Asn Asp Ala Ile Gly Asn Glu 295 300 305 tgg ccc tgg atc tat ttt gtc acc ctc att ttg ctg gga tcc ttc ttc 1197 Trp Pro Trp Ile Tyr Phe Val Thr Leu Ile Leu Leu Gly Ser Phe Phe 310 315 320 atc ctc aac ctg gtg ctg ggt gtc ctg agt ggg gaa ttc acc aag gag 1245 Ile Leu Asn Leu Val Leu Gly Val Leu Ser Gly Glu Phe Thr Lys Glu 325 330 335 340 cgg gag aag gcc aag tcc agg gga acc ttc cag aag ctc cgg gag aag 1293 Arg Glu Lys Ala Lys Ser Arg Gly Thr Phe Gln Lys Leu Arg Glu Lys 345 350 355 cag caa cta gat gag gac ctt cgg ggc tac atg agc tgg atc acg cag 1341 Gln Gln Leu Asp Glu Asp Leu Arg Gly Tyr Met Ser Trp Ile Thr Gln 360 365 370 ggc gag gtc atg gat gtt gag gac ttc aga gaa gga aaa ctg tct ttg 1389 Gly Glu Val Met Asp Val Glu Asp Phe Arg Glu Gly Lys Leu Ser Leu 375 380 385 gat gaa ggt ggc tct gac aca gag agc ctg tat gaa att gca ggc ttg 1437 Asp Glu Gly Gly Ser Asp Thr Glu Ser Leu Tyr Glu Ile Ala Gly Leu 390 395 400 aac aaa atc atc cag ttc atc cga cat tgg agg cag tgg aac cgc atc 1485 Asn Lys Ile Ile Gln Phe Ile Arg His Trp Arg Gln Trp Asn Arg Ile 405 410 415 420 ttt cgc tgg aag tgc cat gac atc gtg aag tcc aag gtc ttc tat tgg 1533 Phe Arg Trp Lys Cys His Asp Ile Val Lys Ser Lys Val Phe Tyr Trp 425 430 435 ctg gtg att ctc atc gtt gcc ctc aac acc ctg tct atc gcc tca gag 1581 Leu Val Ile Leu Ile Val Ala Leu Asn Thr Leu Ser Ile Ala Ser Glu 440 445 450 cac cac aac cag ccg cac tgg ctg acc cgt ttg caa gac att gcc aac 1629 His His Asn Gln Pro His Trp Leu Thr Arg Leu Gln Asp Ile Ala Asn 455 460 465 cgg gtg ctg ctg tcc ctc ttc acc act gag atg ctg atg aag atg tac 1677 Arg Val Leu Leu Ser Leu Phe Thr Thr Glu Met Leu Met Lys Met Tyr 470 475 480 ggg ctg ggc ctg cgc cag tac ttc atg tct atc ttc aac cgc ttc gac 1725 Gly Leu Gly Leu Arg Gln Tyr Phe Met Ser Ile Phe Asn Arg Phe Asp 485 490 495 500 tgc ttc gtg gtg tgc agc ggt atc ctg gag atc ctg ctg gtg gag tcg 1773 Cys Phe Val Val Cys Ser Gly Ile Leu Glu Ile Leu Leu Val Glu Ser 505 510 515 ggc gcc atg aca ccc ctg ggc atc tcc gtg ctc cgc tgc atc cgc ctc 1821 Gly Ala Met Thr Pro Leu Gly Ile Ser Val Leu Arg Cys Ile Arg Leu 520 525 530 ctg agg atc ttc aag atc acc aaa tat tgg acg tcg ctg agc aac ctg 1869 Leu Arg Ile Phe Lys Ile Thr Lys Tyr Trp Thr Ser Leu Ser Asn Leu 535 540 545 gtg gca tcc ctg ctc aac tcc atc cgc tcc atc gcc tcc ctg ctg ctg 1917 Val Ala Ser Leu Leu Asn Ser Ile Arg Ser Ile Ala Ser Leu Leu Leu 550 555 560 ctg ctc ttc ctc ttc atc gtc atc ttc cgc ctc ctg ggc atg cag ctc 1965 Leu Leu Phe Leu Phe Ile Val Ile Phe Arg Leu Leu Gly Met Gln Leu 565 570 575 580 ttt ggg ggg agg tat gac ttt gaa gac aca gaa gta cgg cgc agc aac 2013 Phe Gly Gly Arg Tyr Asp Phe Glu Asp Thr Glu Val Arg Arg Ser Asn 585 590 595 ttt gac aac ttt ccc caa gcc ctc atc agc gtc ttc cag gta ctg aca 2061 Phe Asp Asn Phe Pro Gln Ala Leu Ile Ser Val Phe Gln Val Leu Thr 600 605 610 ggg gaa gac tgg acc tca atg atg tac aat ggg atc atg gcc tcg agc 2109 Gly Glu Asp Trp Thr Ser Met Met Tyr Asn Gly Ile Met Ala Ser Ser 615 620 625 ggg ccg tcc tac cct ggc atg ctt gtg tgc att tac ttc atc atc ctt 2157 Gly Pro Ser Tyr Pro Gly Met Leu Val Cys Ile Tyr Phe Ile Ile Leu 630 635 640 ttc gtc tgt ggc aac tac atc ctg ctc aat gtc ttc ctg gcc att gcc 2205 Phe Val Cys Gly Asn Tyr Ile Leu Leu Asn Val Phe Leu Ala Ile Ala 645 650 655 660 gtg gac aac ctg gcc gag gcg gag agc ctg act tct gcc cag aag gcc 2253 Val Asp Asn Leu Ala Glu Ala Glu Ser Leu Thr Ser Ala Gln Lys Ala 665 670 675 aag gct gag gag aaa aaa cgc agg aag atg tcc aag ggt ctc cca gac 2301 Lys Ala Glu Glu Lys Lys Arg Arg Lys Met Ser Lys Gly Leu Pro Asp 680 685 690 aag tca gaa gag gag aag tca acg atg gcc aag aag ctg gag cag aaa 2349 Lys Ser Glu Glu Glu Lys Ser Thr Met Ala Lys Lys Leu Glu Gln Lys 695 700 705 ccc aag ggt gag ggc atc ccc acc act gcc aag ctg aaa atc gat gag 2397 Pro Lys Gly Glu Gly Ile Pro Thr Thr Ala Lys Leu Lys Ile Asp Glu 710 715 720 ttt gaa tct aat gtc aat gag gtg aag gat ccc tac ccc tca gcc gac 2445 Phe Glu Ser Asn Val Asn Glu Val Lys Asp Pro Tyr Pro Ser Ala Asp 725 730 735 740 ttc cca ggg gat gac gag gaa gat gag cct gag atc ccg ctg agc ccc 2493 Phe Pro Gly Asp Asp Glu Glu Asp Glu Pro Glu Ile Pro Leu Ser Pro 745 750 755 cga cca cgt ccc ctg gct gag ctg cag ctg aaa gag aag gcc gtg ccc 2541 Arg Pro Arg Pro Leu Ala Glu Leu Gln Leu Lys Glu Lys Ala Val Pro 760 765 770 att cca gaa gcc agc tcc ttc ttc atc ttc agc ccc acc aat aag atc 2589 Ile Pro Glu Ala Ser Ser Phe Phe Ile Phe Ser Pro Thr Asn Lys Ile 775 780 785 cgt gtc ctg tgt cac cgc atc gtc aat gcc acc tgg ttc acc aac ttc 2637 Arg Val Leu Cys His Arg Ile Val Asn Ala Thr Trp Phe Thr Asn Phe 790 795 800 atc ctg ctc ttc atc ctg ctc agc agc gct gca ctg gct gcg gaa gac 2685 Ile Leu Leu Phe Ile Leu Leu Ser Ser Ala Ala Leu Ala Ala Glu Asp 805 810 815 820 ccc atc cgg gct gat tcc atg aga aat cag atc ctt aaa cac ttt gac 2733 Pro Ile Arg Ala Asp Ser Met Arg Asn Gln Ile Leu Lys His Phe Asp 825 830 835 atc ggg ttc acc tct gtc ttc act gtg gag att gtc ctc aag atg acg 2781 Ile Gly Phe Thr Ser Val Phe Thr Val Glu Ile Val Leu Lys Met Thr 840 845 850 acc tac gga gcc ttc ctg cac aag ggt tcc ttc tgc cgc aat tac ttc 2829 Thr Tyr Gly Ala Phe Leu His Lys Gly Ser Phe Cys Arg Asn Tyr Phe 855 860 865 aac atg ctg gac ctg ctg gtg gtg gcc gtg tcc ctc atc tcc atg gga 2877 Asn Met Leu Asp Leu Leu Val Val Ala Val Ser Leu Ile Ser Met Gly 870 875 880 ctt gag tcc agt gcc atc tcc gtg gtg aag atc ctg agg gtg ctg agg 2925 Leu Glu Ser Ser Ala Ile Ser Val Val Lys Ile Leu Arg Val Leu Arg 885 890 895 900 gtg ctc cga cca ctc aga gcc atc aac aga gcc aag ggg ttg aag cac 2973 Val Leu Arg Pro Leu Arg Ala Ile Asn Arg Ala Lys Gly Leu Lys His 905 910 915 gtg gct agg tgc atg ttc gtg gcc atc agc acc atc ggg aac atc gtg 3021 Val Ala Arg Cys Met Phe Val Ala Ile Ser Thr Ile Gly Asn Ile Val 920 925 930 ctg gtc act acc ctc cta cag ttc atg ttt gcc tgc atc ggc gtc cag 3069 Leu Val Thr Thr Leu Leu Gln Phe Met Phe Ala Cys Ile Gly Val Gln 935 940 945 ctc ttc aag ggg aag ttc ttc agg tgc acc gac ttg tcc aag atg aca 3117 Leu Phe Lys Gly Lys Phe Phe Arg Cys Thr Asp Leu Ser Lys Met Thr 950 955 960 gag gag gag tgc agg ggc tac tac tac gtg tac aag gac ggg gac ccc 3165 Glu Glu Glu Cys Arg Gly Tyr Tyr Tyr Val Tyr Lys Asp Gly Asp Pro 965 970 975 980 atg cag ata gag ctg cgt cac cgc gag tgg gta cac agc gac ttc cac 3213 Met Gln Ile Glu Leu Arg His Arg Glu Trp Val His Ser Asp Phe His 985 990 995 ttc gac aat gtg ctc tca gcc atg atg tcc ctc ttc acg gtc tcc 3258 Phe Asp Asn Val Leu Ser Ala Met Met Ser Leu Phe Thr Val Ser 1000 1005 1010 acc ttc gag gga tgg cct cag ctg ctg tac aag gcc ata gac tcc 3303 Thr Phe Glu Gly Trp Pro Gln Leu Leu Tyr Lys Ala Ile Asp Ser 1015 1020 1025 aat gcg gag gac gtg ggt ccc atc tac aac aac cgt gtg gag atg 3348 Asn Ala Glu Asp Val Gly Pro Ile Tyr Asn Asn Arg Val Glu Met 1030 1035 1040 gcc atc ttc ttc atc atc tac atc atc ctc att gcc ttc ttc atg 3393 Ala Ile Phe Phe Ile Ile Tyr Ile Ile Leu Ile Ala Phe Phe Met 1045 1050 1055 atg aac atc ttt gtg ggc ttc gtc att gtc acc ttc cag gag cag 3438 Met Asn Ile Phe Val Gly Phe Val Ile Val Thr Phe Gln Glu Gln 1060 1065 1070 gga gag act gag tac aag aac tgt gag ctg gac aag aac cag cgc 3483 Gly Glu Thr Glu Tyr Lys Asn Cys Glu Leu Asp Lys Asn Gln Arg 1075 1080 1085 caa tgt gta cag tat gcc ctg aag gcc cgc cca ctg agg tgc tac 3528 Gln Cys Val Gln Tyr Ala Leu Lys Ala Arg Pro Leu Arg Cys Tyr 1090 1095 1100 att ccc aaa aac cca tac cag tac cag gtg tgg tac att gtc acc 3573 Ile Pro Lys Asn Pro Tyr Gln Tyr Gln Val Trp Tyr Ile Val Thr 1105 1110 1115 tcc tcc tac ttt gaa tac ctg atg ttt gcc ctc atc atg ctc aac 3618 Ser Ser Tyr Phe Glu Tyr Leu Met Phe Ala Leu Ile Met Leu Asn 1120 1125 1130 acc atc tgc ctc ggc atg cag cac tac aac cag tcg gag cag atg 3663 Thr Ile Cys Leu Gly Met Gln His Tyr Asn Gln Ser Glu Gln Met 1135 1140 1145 aac cac atc tca gac atc ctc aat gtg gcc ttc act atc atc ttc 3708 Asn His Ile Ser Asp Ile Leu Asn Val Ala Phe Thr Ile Ile Phe 1150 1155 1160 acc ctg gag atg atc ctc aag ctc atg gcc ttc aag gcc agg ggc 3753 Thr Leu Glu Met Ile Leu Lys Leu Met Ala Phe Lys Ala Arg Gly 1165 1170 1175 tac ttt gga aac ccc tgg aat gtg ttt gac ttc ctg att gtc att 3798 Tyr Phe Gly Asn Pro Trp Asn Val Phe Asp Phe Leu Ile Val Ile 1180 1185 1190 ggc agc atc att gat gtc atc ctc agt gag atc gac act ttc ctg 3843 Gly Ser Ile Ile Asp Val Ile Leu Ser Glu Ile Asp Thr Phe Leu 1195 1200 1205 gcc tcc agc ggg gga ctg tat tgc ctg ggt gga ggc tgc ggg aac 3888 Ala Ser Ser Gly Gly Leu Tyr Cys Leu Gly Gly Gly Cys Gly Asn 1210 1215 1220 gtt gac cca gat gag agt gcc cgc atc tcc agc gcc ttc ttc cgc 3933 Val Asp Pro Asp Glu Ser Ala Arg Ile Ser Ser Ala Phe Phe Arg 1225 1230 1235 ctg ttc cgt gtc atg agg ctg atc aag ctg ctg agc cgg gca gaa 3978 Leu Phe Arg Val Met Arg Leu Ile Lys Leu Leu Ser Arg Ala Glu 1240 1245 1250 gga gtg cga acc ctc ctg tgg acg ttc atc aag tcc ttc cag gcc 4023 Gly Val Arg Thr Leu Leu Trp Thr Phe Ile Lys Ser Phe Gln Ala 1255 1260 1265 cta ccc tac gtg gct ctg ctc atc gtc atg ctc ttc ttc atc tac 4068 Leu Pro Tyr Val Ala Leu Leu Ile Val Met Leu Phe Phe Ile Tyr 1270 1275 1280 gct gtc atc ggc atg cag atg ttt ggg aag atc gcc ttg gtg gat 4113 Ala Val Ile Gly Met Gln Met Phe Gly Lys Ile Ala Leu Val Asp 1285 1290 1295 ggg acc caa ata aac cgg aac aac aac ttc cag acc ttc cca caa 4158 Gly Thr Gln Ile Asn Arg Asn Asn Asn Phe Gln Thr Phe Pro Gln 1300 1305 1310 gct gtg cta ctg ctc ttc agg tgt gca aca ggt gag gcc tgg cag 4203 Ala Val Leu Leu Leu Phe Arg Cys Ala Thr Gly Glu Ala Trp Gln 1315 1320 1325 gag atc cta ctg gcc tgc agc tat ggg aag ctg tgt gac cca gag 4248 Glu Ile Leu Leu Ala Cys Ser Tyr Gly Lys Leu Cys Asp Pro Glu 1330 1335 1340 tcg gac tat gcc cca ggg gag gag tac aca tgt ggc acc aac ttt 4293 Ser Asp Tyr Ala Pro Gly Glu Glu Tyr Thr Cys Gly Thr Asn Phe 1345 1350 1355 gca tac tac tac ttc atc agc ttc tac atg ctc tgt gcc ttc ctg 4338 Ala Tyr Tyr Tyr Phe Ile Ser Phe Tyr Met Leu Cys Ala Phe Leu 1360 1365 1370 gtc atc aac ctc ttt gtg gct gtc atc atg gac aat ttt gac tac 4383 Val Ile Asn Leu Phe Val Ala Val Ile Met Asp Asn Phe Asp Tyr 1375 1380 1385 ctc acc cgg gac tgg tcc atc ctg ggc cct cat cac ctg gat gag 4428 Leu Thr Arg Asp Trp Ser Ile Leu Gly Pro His His Leu Asp Glu 1390 1395 1400 ttc aag gcc atc tgg gca gag tat gac cca gag gct aag ggg agg 4473 Phe Lys Ala Ile Trp Ala Glu Tyr Asp Pro Glu Ala Lys Gly Arg 1405 1410 1415 atc aaa cac ctg gac gtg gtg acc ctg ctg aga agg att cag ccc 4518 Ile Lys His Leu Asp Val Val Thr Leu Leu Arg Arg Ile Gln Pro 1420 1425 1430 cct ctg ggc ttt ggg aag ttc tgc cca cat cgg gta gct tgt aag 4563 Pro Leu Gly Phe Gly Lys Phe Cys Pro His Arg Val Ala Cys Lys 1435 1440 1445 cgg ctg gtg ggc atg aac atg ccc ctg aac agc gac ggc aca gtc 4608 Arg Leu Val Gly Met Asn Met Pro Leu Asn Ser Asp Gly Thr Val 1450 1455 1460 acc ttc aat gcc aca ctc ttt gcc ctg gtc cgc acg gca ctc aag 4653 Thr Phe Asn Ala Thr Leu Phe Ala Leu Val Arg Thr Ala Leu Lys 1465 1470 1475 atc aag acg gaa ggt aac ttt gag cag gcc aac gag gag ctg agg 4698 Ile Lys Thr Glu Gly Asn Phe Glu Gln Ala Asn Glu Glu Leu Arg 1480 1485 1490 gcc atc atc aag aag atc tgg aag aga acc agc atg aag ctc ttg 4743 Ala Ile Ile Lys Lys Ile Trp Lys Arg Thr Ser Met Lys Leu Leu 1495 1500 1505 gac cag gtc atc cct cca ata gga gat gat gag gtg aca gtg ggg 4788 Asp Gln Val Ile Pro Pro Ile Gly Asp Asp Glu Val Thr Val Gly 1510 1515 1520 aag ttc tac gcc aca ttc ctc atc cag gag cac ttc cgg aag ttc 4833 Lys Phe Tyr Ala Thr Phe Leu Ile Gln Glu His Phe Arg Lys Phe 1525 1530 1535 atg aaa cgc caa gag gag tat tat ggc tat cgg ccc aag aag gac 4878 Met Lys Arg Gln Glu Glu Tyr Tyr Gly Tyr Arg Pro Lys Lys Asp 1540 1545 1550 att gta cag atc cag gca ggg ctg cgg acc att gag gaa gag gca 4923 Ile Val Gln Ile Gln Ala Gly Leu Arg Thr Ile Glu Glu Glu Ala 1555 1560 1565 gcc ccc gag atc tgt cgc acg gtc tca gga gac ctg gct gct gag 4968 Ala Pro Glu Ile Cys Arg Thr Val Ser Gly Asp Leu Ala Ala Glu 1570 1575 1580 gag gag ctg gag aga gcc atg gtg gag gct gcg atg gag gag ggg 5013 Glu Glu Leu Glu Arg Ala Met Val Glu Ala Ala Met Glu Glu Gly 1585 1590 1595 ata ttc cgg agg act gga ggc ctg ttt ggc cag gtg gac aac ttc 5058 Ile Phe Arg Arg Thr Gly Gly Leu Phe Gly Gln Val Asp Asn Phe 1600 1605 1610 ctg gaa agg acc aac tcc ctg ccc cct gtc atg gcc aat cag aga 5103 Leu Glu Arg Thr Asn Ser Leu Pro Pro Val Met Ala Asn Gln Arg 1615 1620 1625 ccc ctc cag ttt gct gag ata gag atg gaa gag atg gag tca cct 5148 Pro Leu Gln Phe Ala Glu Ile Glu Met Glu Glu Met Glu Ser Pro 1630 1635 1640 gtc ttc ttg gag gac ttc cca caa gat cca cgc acc aac ccc ctg 5193 Val Phe Leu Glu Asp Phe Pro Gln Asp Pro Arg Thr Asn Pro Leu 1645 1650 1655 gct cgt gcc aat acc aac aat gcc aac gcc aat gtc gcc tat gcg 5238 Ala Arg Ala Asn Thr Asn Asn Ala Asn Ala Asn Val Ala Tyr Ala 1660 1665 1670 aac agc aac cat agc aac agc cat gtg ttt tcc agt gtc cac tat 5283 Asn Ser Asn His Ser Asn Ser His Val Phe Ser Ser Val His Tyr 1675 1680 1685 gaa agg gag ttc cca gaa gag aca gag acg cct gct acc aga gga 5328 Glu Arg Glu Phe Pro Glu Glu Thr Glu Thr Pro Ala Thr Arg Gly 1690 1695 1700 cga gcc ctt ggc caa ccc tgc agg tcc ctg gga ccc cac agc aaa 5373 Arg Ala Leu Gly Gln Pro Cys Arg Ser Leu Gly Pro His Ser Lys 1705 1710 1715 ccc tgt gtg gag atg ctg aag gga ctg ctg acc cag agg gca atg 5418 Pro Cys Val Glu Met Leu Lys Gly Leu Leu Thr Gln Arg Ala Met 1720 1725 1730 ccc aga ggc cag gca cct cct gcc ccc tgc cag tgc ccc agg gtg 5463 Pro Arg Gly Gln Ala Pro Pro Ala Pro Cys Gln Cys Pro Arg Val 1735 1740 1745 gag tcc tcc atg cct gag gac aga aag agc tcc aca cca ggg tct 5508 Glu Ser Ser Met Pro Glu Asp Arg Lys Ser Ser Thr Pro Gly Ser 1750 1755 1760 ctt cat gag gag aca ccc cac agc agg agc acc agg gag aat act 5553 Leu His Glu Glu Thr Pro His Ser Arg Ser Thr Arg Glu Asn Thr 1765 1770 1775 tcc agg tgc tca gca cca gct aca gcc ctg ctg atc caa aag gct 5598 Ser Arg Cys Ser Ala Pro Ala Thr Ala Leu Leu Ile Gln Lys Ala 1780 1785 1790 ctg gtt cga ggg ggc ctg ggc acc ttg gca gct gat gca aac ttc 5643 Leu Val Arg Gly Gly Leu Gly Thr Leu Ala Ala Asp Ala Asn Phe 1795 1800 1805 atc atg gca aca ggc cag gcc ctc gga gat gcc tgc caa atg gaa 5688 Ile Met Ala Thr Gly Gln Ala Leu Gly Asp Ala Cys Gln Met Glu 1810 1815 1820 cca gag gaa gtg gag atc atg gca aca gag cta ctg aaa gga cga 5733 Pro Glu Glu Val Glu Ile Met Ala Thr Glu Leu Leu Lys Gly Arg 1825 1830 1835 gag gcc cca gac ggc atg gcc agc tcc ctg gga tgc ctg aac ctc 5778 Glu Ala Pro Asp Gly Met Ala Ser Ser Leu Gly Cys Leu Asn Leu 1840 1845 1850 ggg tcc tcc ctg ggc agc ctc gac caa cac cag ggc tcc cag gag 5823 Gly Ser Ser Leu Gly Ser Leu Asp Gln His Gln Gly Ser Gln Glu 1855 1860 1865 acc ctt att cct cca agg ctg tga tgcccacaca gcatcagcat gggcttagag 5877 Thr Leu Ile Pro Pro Arg Leu 1870 ctggcatgac caatgggggt ggggaagttg ctggggtgga gaagggctag cccaccgcag 5937 cagcctccct ccctctcagc agctagatgc atgcctgagg cagggtggtc aggaaccacc 5997 tcaaaaagtg cggaggaagt agctggacag gccctgcccc tcaccagcaa gaggcatgat 6057 tggatggagc ttctaatgtc attcaaaaag gcctggtcag tgcctgtccc tagggccact 6117 cccacctgca ggacattaaa atctccaggc ctgtgacact ggc 6160 14 1873 PRT Homo sapiens 14 Met Glu Pro Ser Ser Pro Gln Asp Glu Gly Leu Arg Lys Lys Gln Pro 1 5 10 15 Lys Lys Pro Val Pro Glu Ile Leu Pro Arg Pro Pro Arg Ala Leu Phe 20 25 30 Cys Leu Thr Leu Glu Asn Pro Leu Arg Lys Ala Cys Ile Ser Ile Val 35 40 45 Glu Trp Lys Pro Phe Glu Thr Ile Ile Leu Leu Thr Ile Phe Ala Asn 50 55 60 Cys Val Ala Leu Ala Val Tyr Leu Pro Met Pro Glu Asp Asp Asn Asn 65 70 75 80 Ser Leu Asn Leu Gly Leu Glu Lys Leu Glu Tyr Phe Phe Leu Ile Val 85 90 95 Phe Ser Ile Glu Ala Ala Met Lys Ile Ile Ala Tyr Gly Phe Leu Phe 100 105 110 His Gln Asp Ala Tyr Leu Arg Ser Gly Trp Asn Val Leu Asp Phe Thr 115 120 125 Ile Val Phe Leu Gly Val Phe Thr Val Ile Leu Glu Gln Val Asn Val 130 135 140 Ile Gln Ser His Thr Ala Pro Met Ser Ser Lys Gly Ala Gly Leu Asp 145 150 155 160 Val Lys Ala Leu Arg Ala Phe Arg Val Leu Arg Pro Leu Arg Leu Val 165 170 175 Ser Gly Val Pro Ser Leu Gln Val Val Leu Asn Ser Ile Phe Lys Ala 180 185 190 Met Leu Pro Leu Phe His Ile Ala Leu Leu Val Leu Phe Met Val Ile 195 200 205 Ile Tyr Ala Ile Ile Gly Leu Glu Leu Phe Lys Gly Lys Met His Lys 210 215 220 Thr Cys Tyr Phe Ile Gly Thr Asp Ile Val Ala Thr Val Glu Asn Glu 225 230 235 240 Glu Pro Ser Pro Cys Ala Arg Thr Gly Ser Gly Arg Arg Cys Thr Ile 245 250 255 Asn Gly Ser Glu Cys Arg Gly Gly Cys Pro Gly Pro Asn His Gly Ile 260 265 270 Thr His Phe Asp Asn Phe Gly Phe Ser Met Leu Thr Val Tyr Gln Cys 275 280 285 Ile Thr Met Glu Gly Trp Thr Asp Val Leu Tyr Trp Val Asn Asp Ala 290 295 300 Ile Gly Asn Glu Trp Pro Trp Ile Tyr Phe Val Thr Leu Ile Leu Leu 305 310 315 320 Gly Ser Phe Phe Ile Leu Asn Leu Val Leu Gly Val Leu Ser Gly Glu 325 330 335 Phe Thr Lys Glu Arg Glu Lys Ala Lys Ser Arg Gly Thr Phe Gln Lys 340 345 350 Leu Arg Glu Lys Gln Gln Leu Asp Glu Asp Leu Arg Gly Tyr Met Ser 355 360 365 Trp Ile Thr Gln Gly Glu Val Met Asp Val Glu Asp Phe Arg Glu Gly 370 375 380 Lys Leu Ser Leu Asp Glu Gly Gly Ser Asp Thr Glu Ser Leu Tyr Glu 385 390 395 400 Ile Ala Gly Leu Asn Lys Ile Ile Gln Phe Ile Arg His Trp Arg Gln 405 410 415 Trp Asn Arg Ile Phe Arg Trp Lys Cys His Asp Ile Val Lys Ser Lys 420 425 430 Val Phe Tyr Trp Leu Val Ile Leu Ile Val Ala Leu Asn Thr Leu Ser 435 440 445 Ile Ala Ser Glu His His Asn Gln Pro His Trp Leu Thr Arg Leu Gln 450 455 460 Asp Ile Ala Asn Arg Val Leu Leu Ser Leu Phe Thr Thr Glu Met Leu 465 470 475 480 Met Lys Met Tyr Gly Leu Gly Leu Arg Gln Tyr Phe Met Ser Ile Phe 485 490 495 Asn Arg Phe Asp Cys Phe Val Val Cys Ser Gly Ile Leu Glu Ile Leu 500 505 510 Leu Val Glu Ser Gly Ala Met Thr Pro Leu Gly Ile Ser Val Leu Arg 515 520 525 Cys Ile Arg Leu Leu Arg Ile Phe Lys Ile Thr Lys Tyr Trp Thr Ser 530 535 540 Leu Ser Asn Leu Val Ala Ser Leu Leu Asn Ser Ile Arg Ser Ile Ala 545 550 555 560 Ser Leu Leu Leu Leu Leu Phe Leu Phe Ile Val Ile Phe Arg Leu Leu 565 570 575 Gly Met Gln Leu Phe Gly Gly Arg Tyr Asp Phe Glu Asp Thr Glu Val 580 585 590 Arg Arg Ser Asn Phe Asp Asn Phe Pro Gln Ala Leu Ile Ser Val Phe 595 600 605 Gln Val Leu Thr Gly Glu Asp Trp Thr Ser Met Met Tyr Asn Gly Ile 610 615 620 Met Ala Ser Ser Gly Pro Ser Tyr Pro Gly Met Leu Val Cys Ile Tyr 625 630 635 640 Phe Ile Ile Leu Phe Val Cys Gly Asn Tyr Ile Leu Leu Asn Val Phe 645 650 655 Leu Ala Ile Ala Val Asp Asn Leu Ala Glu Ala Glu Ser Leu Thr Ser 660 665 670 Ala Gln Lys Ala Lys Ala Glu Glu Lys Lys Arg Arg Lys Met Ser Lys 675 680 685 Gly Leu Pro Asp Lys Ser Glu Glu Glu Lys Ser Thr Met Ala Lys Lys 690 695 700 Leu Glu Gln Lys Pro Lys Gly Glu Gly Ile Pro Thr Thr Ala Lys Leu 705 710 715 720 Lys Ile Asp Glu Phe Glu Ser Asn Val Asn Glu Val Lys Asp Pro Tyr 725 730 735 Pro Ser Ala Asp Phe Pro Gly Asp Asp Glu Glu Asp Glu Pro Glu Ile 740 745 750 Pro Leu Ser Pro Arg Pro Arg Pro Leu Ala Glu Leu Gln Leu Lys Glu 755 760 765 Lys Ala Val Pro Ile Pro Glu Ala Ser Ser Phe Phe Ile Phe Ser Pro 770 775 780 Thr Asn Lys Ile Arg Val Leu Cys His Arg Ile Val Asn Ala Thr Trp 785 790 795 800 Phe Thr Asn Phe Ile Leu Leu Phe Ile Leu Leu Ser Ser Ala Ala Leu 805 810 815 Ala Ala Glu Asp Pro Ile Arg Ala Asp Ser Met Arg Asn Gln Ile Leu 820 825 830 Lys His Phe Asp Ile Gly Phe Thr Ser Val Phe Thr Val Glu Ile Val 835 840 845 Leu Lys Met Thr Thr Tyr Gly Ala Phe Leu His Lys Gly Ser Phe Cys 850 855 860 Arg Asn Tyr Phe Asn Met Leu Asp Leu Leu Val Val Ala Val Ser Leu 865 870 875 880 Ile Ser Met Gly Leu Glu Ser Ser Ala Ile Ser Val Val Lys Ile Leu 885 890 895 Arg Val Leu Arg Val Leu Arg Pro Leu Arg Ala Ile Asn Arg Ala Lys 900 905 910 Gly Leu Lys His Val Ala Arg Cys Met Phe Val Ala Ile Ser Thr Ile 915 920 925 Gly Asn Ile Val Leu Val Thr Thr Leu Leu Gln Phe Met Phe Ala Cys 930 935 940 Ile Gly Val Gln Leu Phe Lys Gly Lys Phe Phe Arg Cys Thr Asp Leu 945 950 955 960 Ser Lys Met Thr Glu Glu Glu Cys Arg Gly Tyr Tyr Tyr Val Tyr Lys 965 970 975 Asp Gly Asp Pro Met Gln Ile Glu Leu Arg His Arg Glu Trp Val His 980 985 990 Ser Asp Phe His Phe Asp Asn Val Leu Ser Ala Met Met Ser Leu Phe 995 1000 1005 Thr Val Ser Thr Phe Glu Gly Trp Pro Gln Leu Leu Tyr Lys Ala 1010 1015 1020 Ile Asp Ser Asn Ala Glu Asp Val Gly Pro Ile Tyr Asn Asn Arg 1025 1030 1035 Val Glu Met Ala Ile Phe Phe Ile Ile Tyr Ile Ile Leu Ile Ala 1040 1045 1050 Phe Phe Met Met Asn Ile Phe Val Gly Phe Val Ile Val Thr Phe 1055 1060 1065 Gln Glu Gln Gly Glu Thr Glu Tyr Lys Asn Cys Glu Leu Asp Lys 1070 1075 1080 Asn Gln Arg Gln Cys Val Gln Tyr Ala Leu Lys Ala Arg Pro Leu 1085 1090 1095 Arg Cys Tyr Ile Pro Lys Asn Pro Tyr Gln Tyr Gln Val Trp Tyr 1100 1105 1110 Ile Val Thr Ser Ser Tyr Phe Glu Tyr Leu Met Phe Ala Leu Ile 1115 1120 1125 Met Leu Asn Thr Ile Cys Leu Gly Met Gln His Tyr Asn Gln Ser 1130 1135 1140 Glu Gln Met Asn His Ile Ser Asp Ile Leu Asn Val Ala Phe Thr 1145 1150 1155 Ile Ile Phe Thr Leu Glu Met Ile Leu Lys Leu Met Ala Phe Lys 1160 1165 1170 Ala Arg Gly Tyr Phe Gly Asn Pro Trp Asn Val Phe Asp Phe Leu 1175 1180 1185 Ile Val Ile Gly Ser Ile Ile Asp Val Ile Leu Ser Glu Ile Asp 1190 1195 1200 Thr Phe Leu Ala Ser Ser Gly Gly Leu Tyr Cys Leu Gly Gly Gly 1205 1210 1215 Cys Gly Asn Val Asp Pro Asp Glu Ser Ala Arg Ile Ser Ser Ala 1220 1225 1230 Phe Phe Arg Leu Phe Arg Val Met Arg Leu Ile Lys Leu Leu Ser 1235 1240 1245 Arg Ala Glu Gly Val Arg Thr Leu Leu Trp Thr Phe Ile Lys Ser 1250 1255 1260 Phe Gln Ala Leu Pro Tyr Val Ala Leu Leu Ile Val Met Leu Phe 1265 1270 1275 Phe Ile Tyr Ala Val Ile Gly Met Gln Met Phe Gly Lys Ile Ala 1280 1285 1290 Leu Val Asp Gly Thr Gln Ile Asn Arg Asn Asn Asn Phe Gln Thr 1295 1300 1305 Phe Pro Gln Ala Val Leu Leu Leu Phe Arg Cys Ala Thr Gly Glu 1310 1315 1320 Ala Trp Gln Glu Ile Leu Leu Ala Cys Ser Tyr Gly Lys Leu Cys 1325 1330 1335 Asp Pro Glu Ser Asp Tyr Ala Pro Gly Glu Glu Tyr Thr Cys Gly 1340 1345 1350 Thr Asn Phe Ala Tyr Tyr Tyr Phe Ile Ser Phe Tyr Met Leu Cys 1355 1360 1365 Ala Phe Leu Val Ile Asn Leu Phe Val Ala Val Ile Met Asp Asn 1370 1375 1380 Phe Asp Tyr Leu Thr Arg Asp Trp Ser Ile Leu Gly Pro His His 1385 1390 1395 Leu Asp Glu Phe Lys Ala Ile Trp Ala Glu Tyr Asp Pro Glu Ala 1400 1405 1410 Lys Gly Arg Ile Lys His Leu Asp Val Val Thr Leu Leu Arg Arg 1415 1420 1425 Ile Gln Pro Pro Leu Gly Phe Gly Lys Phe Cys Pro His Arg Val 1430 1435 1440 Ala Cys Lys Arg Leu Val Gly Met Asn Met Pro Leu Asn Ser Asp 1445 1450 1455 Gly Thr Val Thr Phe Asn Ala Thr Leu Phe Ala Leu Val Arg Thr 1460 1465 1470 Ala Leu Lys Ile Lys Thr Glu Gly Asn Phe Glu Gln Ala Asn Glu 1475 1480 1485 Glu Leu Arg Ala Ile Ile Lys Lys Ile Trp Lys Arg Thr Ser Met 1490 1495 1500 Lys Leu Leu Asp Gln Val Ile Pro Pro Ile Gly Asp Asp Glu Val 1505 1510 1515 Thr Val Gly Lys Phe Tyr Ala Thr Phe Leu Ile Gln Glu His Phe 1520 1525 1530 Arg Lys Phe Met Lys Arg Gln Glu Glu Tyr Tyr Gly Tyr Arg Pro 1535 1540 1545 Lys Lys Asp Ile Val Gln Ile Gln Ala Gly Leu Arg Thr Ile Glu 1550 1555 1560 Glu Glu Ala Ala Pro Glu Ile Cys Arg Thr Val Ser Gly Asp Leu 1565 1570 1575 Ala Ala Glu Glu Glu Leu Glu Arg Ala Met Val Glu Ala Ala Met 1580 1585 1590 Glu Glu Gly Ile Phe Arg Arg Thr Gly Gly Leu Phe Gly Gln Val 1595 1600 1605 Asp Asn Phe Leu Glu Arg Thr Asn Ser Leu Pro Pro Val Met Ala 1610 1615 1620 Asn Gln Arg Pro Leu Gln Phe Ala Glu Ile Glu Met Glu Glu Met 1625 1630 1635 Glu Ser Pro Val Phe Leu Glu Asp Phe Pro Gln Asp Pro Arg Thr 1640 1645 1650 Asn Pro Leu Ala Arg Ala Asn Thr Asn Asn Ala Asn Ala Asn Val 1655 1660 1665 Ala Tyr Ala Asn Ser Asn His Ser Asn Ser His Val Phe Ser Ser 1670 1675 1680 Val His Tyr Glu Arg Glu Phe Pro Glu Glu Thr Glu Thr Pro Ala 1685 1690 1695 Thr Arg Gly Arg Ala Leu Gly Gln Pro Cys Arg Ser Leu Gly Pro 1700 1705 1710 His Ser Lys Pro Cys Val Glu Met Leu Lys Gly Leu Leu Thr Gln 1715 1720 1725 Arg Ala Met Pro Arg Gly Gln Ala Pro Pro Ala Pro Cys Gln Cys 1730 1735 1740 Pro Arg Val Glu Ser Ser Met Pro Glu Asp Arg Lys Ser Ser Thr 1745 1750 1755 Pro Gly Ser Leu His Glu Glu Thr Pro His Ser Arg Ser Thr Arg 1760 1765 1770 Glu Asn Thr Ser Arg Cys Ser Ala Pro Ala Thr Ala Leu Leu Ile 1775 1780 1785 Gln Lys Ala Leu Val Arg Gly Gly Leu Gly Thr Leu Ala Ala Asp 1790 1795 1800 Ala Asn Phe Ile Met Ala Thr Gly Gln Ala Leu Gly Asp Ala Cys 1805 1810 1815 Gln Met Glu Pro Glu Glu Val Glu Ile Met Ala Thr Glu Leu Leu 1820 1825 1830 Lys Gly Arg Glu Ala Pro Asp Gly Met Ala Ser Ser Leu Gly Cys 1835 1840 1845 Leu Asn Leu Gly Ser Ser Leu Gly Ser Leu Asp Gln His Gln Gly 1850 1855 1860 Ser Gln Glu Thr Leu Ile Pro Pro Arg Leu 1865 1870 15 7362 DNA Homo sapiens gene (1)..(7362) n is dATP, dCTP, dGTP, or dTTP 15 agaataaggg cagggaccgc ggctcctacc tcttggtgat ccccttcccc attccgcccc 60 cgcctcaacg cccagcacag tgccctgcac acagtagtcg ctcaataaat gttcgtggat 120 gatgatgatg atgatgatga aaaaaatgca gcatcaacgg cagcagcaag cggaccacgc 180 gaacgaggca aactatgcaa gaggcaccag acttcctctt tctggtgaag gaccaacttc 240 tcagccgaat agctccaagc aaactgtcct gtcttggcaa gctgcaatcg atgctgctag 300 acaggccaag gctgcccaaa ctatgagcac ctctgcaccc ccacctgtag gatctctctc 360 ccaaagaaaa cgtcagcaat acgccaagag caaaaaacag ggtaactcgt ccaacagccg 420 acctgcccgc gcccttttct gtttatcact caataacccc atccgaagag cctgcattag 480 tatagtggaa tggaaaccat ttgacatatt tatattattg gctatttttg ccaattgtgt 540 ggccttagct atttacatcc cattccctga agatgattct aattcaacaa atcataactt 600 ggaaaaagta gaatatgcct tcctgattat ttttacagtc gagacatttt tgaagattat 660 agcgtatgga ttattgctac atcctaatgc ttatgttagg aatggatgga atttactgga 720 ttttgttata gtaatagtag gattgtttag tgtaattttg gaacaattaa ccaaagaaac 780 agaaggcggg aaccactcaa gcggcaaatc tggaggcttt gatgtcaaag ccctccgtgc 840 ctttcgagtg ttgcgaccac ttcgactagt gtcaggagtg cccagtttac aagttgtcct 900 gaactccatt ataaaagcca tggttcccct ccttcacata gcccttttgg tattatttgt 960 aatcataatc tatgctatta taggattgga actttttatt ggaaaaatgc acaaaacatg 1020 tttttttgct gactcagata tcgtagctga agaggaccca gctccatgtg cgttctcagg 1080 gaatggacgc cagtgtactg ccaatggcac ggaatgtagg agtggctggg ttggcccgaa 1140 cggaggcatc accaactttg ataactttgc ctttgccatg cttactgtgt ttcagtgcat 1200 caccatggag ggctggacag atgtgctcta ctgggtaaat gatgcgatag gatgggaatg 1260 gccatgggtg tattttgtta gtctgatcat ccttggctca tttttcgtcc ttaacctggc 1320 tcatttttcg tccttatcag agaattctca aaggaaagag agaaggcaaa agcacgggga 1380 gatttccaga agctccggga gaagcagcag ctggaggagg atctaaaggg ctacttggat 1440 tggatcaccc aagctgagga catcgatccg gagaatgagg aagaaggagg agaggaaggc 1500 aaacgaaata ctagc atg ccc acc agc gag act gag tct gtg aac aca gag 1551 Met Pro Thr Ser Glu Thr Glu Ser Val Asn Thr Glu 1 5 10 aac gtc agc ggt gaa ggc gag aac cga ggc tgc tgt gga agt ctc tgg 1599 Asn Val Ser Gly Glu Gly Glu Asn Arg Gly Cys Cys Gly Ser Leu Trp 15 20 25 tgc tgg tgg aga cgg aga ggc gcg gcc aag gcg ggg ccc tct ggg tgt 1647 Cys Trp Trp Arg Arg Arg Gly Ala Ala Lys Ala Gly Pro Ser Gly Cys 30 35 40 cgg cgg tgg ggt caa gcc atc tca aaa tcc aaa ctc agc cga cgc tgg 1695 Arg Arg Trp Gly Gln Ala Ile Ser Lys Ser Lys Leu Ser Arg Arg Trp 45 50 55 60 cgt cgc tgg aac cga ttc aat cgc aga aga tgt agg gcc gcc gtg aag 1743 Arg Arg Trp Asn Arg Phe Asn Arg Arg Arg Cys Arg Ala Ala Val Lys 65 70 75 tct gtc acg ttt tac tgg ctg gtt atc gtc ctg gtg ttt ctg aac acc 1791 Ser Val Thr Phe Tyr Trp Leu Val Ile Val Leu Val Phe Leu Asn Thr 80 85 90 tta acc att tcc tct gag cac tac aat cag cca gat tgg ttg aca cag 1839 Leu Thr Ile Ser Ser Glu His Tyr Asn Gln Pro Asp Trp Leu Thr Gln 95 100 105 att caa gat atg ccc aac aaa gtc ctc ttg gct ctg ttc acc tgc gag 1887 Ile Gln Asp Met Pro Asn Lys Val Leu Leu Ala Leu Phe Thr Cys Glu 110 115 120 atg ctg gta aaa atg tac agc ttg ggc ctc caa gca tat ttc gtc tct 1935 Met Leu Val Lys Met Tyr Ser Leu Gly Leu Gln Ala Tyr Phe Val Ser 125 130 135 140 ctt ttc aac cgg ttt gat tgc ttc gtg gtg tgt ggt gga atc act gag 1983 Leu Phe Asn Arg Phe Asp Cys Phe Val Val Cys Gly Gly Ile Thr Glu 145 150 155 acg atc ttg gtg gaa ctg gaa atc atg tct ccc ctg ggg atc tct gtg 2031 Thr Ile Leu Val Glu Leu Glu Ile Met Ser Pro Leu Gly Ile Ser Val 160 165 170 ttt cgg tgt gtg cgc ctc tta aga atc ttc aaa gtg acc agg cac tgg 2079 Phe Arg Cys Val Arg Leu Leu Arg Ile Phe Lys Val Thr Arg His Trp 175 180 185 act tcc ctg agc aac tta gtg gca tcc tta tta aac tcc atg aag tcc 2127 Thr Ser Leu Ser Asn Leu Val Ala Ser Leu Leu Asn Ser Met Lys Ser 190 195 200 atc gct tcg ctg ttg ctt ctg ctt ttt ctc ttc att atc atc ttt tcc 2175 Ile Ala Ser Leu Leu Leu Leu Leu Phe Leu Phe Ile Ile Ile Phe Ser 205 210 215 220 ttg ctt ggg atg cag ctg ttt ggc ggc aag ttt aat ttt gat gaa acg 2223 Leu Leu Gly Met Gln Leu Phe Gly Gly Lys Phe Asn Phe Asp Glu Thr 225 230 235 caa acc aag cgg agc acc ttt gac aat ttc cct caa gca ctt ctc aca 2271 Gln Thr Lys Arg Ser Thr Phe Asp Asn Phe Pro Gln Ala Leu Leu Thr 240 245 250 gtg ttc cag atc ctg aca ggc gaa gac tgg aat gct gtg atg tac gat 2319 Val Phe Gln Ile Leu Thr Gly Glu Asp Trp Asn Ala Val Met Tyr Asp 255 260 265 ggc atc atg gct tac ggg ggc cca tcc tct tca gga atg atc gtc tgc 2367 Gly Ile Met Ala Tyr Gly Gly Pro Ser Ser Ser Gly Met Ile Val Cys 270 275 280 atc tac ttc atc atc ctc ttc att tgt ggt aac tat att cta ctg aat 2415 Ile Tyr Phe Ile Ile Leu Phe Ile Cys Gly Asn Tyr Ile Leu Leu Asn 285 290 295 300 gtc ttc ttg gcc atc gct gta gac aat ttg gct gat gct gaa agt ctg 2463 Val Phe Leu Ala Ile Ala Val Asp Asn Leu Ala Asp Ala Glu Ser Leu 305 310 315 aac act gct cag aaa gaa gaa gcg gaa gaa aag gag agg aaa aag att 2511 Asn Thr Ala Gln Lys Glu Glu Ala Glu Glu Lys Glu Arg Lys Lys Ile 320 325 330 gcc aga aaa gag agc cta gaa aat aaa aag aac aac aaa cca gaa gtc 2559 Ala Arg Lys Glu Ser Leu Glu Asn Lys Lys Asn Asn Lys Pro Glu Val 335 340 345 aac cag ata gcc aac agt gac aac aag gtt aca att gat gac tat aga 2607 Asn Gln Ile Ala Asn Ser Asp Asn Lys Val Thr Ile Asp Asp Tyr Arg 350 355 360 gaa gag gat gaa gac aag gac ccc tat ccg cct tgc gat gtg cca ggt 2655 Glu Glu Asp Glu Asp Lys Asp Pro Tyr Pro Pro Cys Asp Val Pro Gly 365 370 375 380 atg gtg gnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn 2703 Met Val Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 385 390 395 nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn 2751 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 400 405 410 nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn 2799 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 415 420 425 nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn 2847 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 430 435 440 nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn 2895 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 445 450 455 460 nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn 2943 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 465 470 475 nnn nnn nnn nnn nnn nnn ccc tgc agg atc cgc gta ggc tgc cac aag 2991 Xaa Xaa Xaa Xaa Xaa Xaa Pro Cys Arg Ile Arg Val Gly Cys His Lys 480 485 490 ctc atc aac cac cac atc ttc acc aac ctc atc ctt gtc ttc atc atg 3039 Leu Ile Asn His His Ile Phe Thr Asn Leu Ile Leu Val Phe Ile Met 495 500 505 ctg agc agc gct gcc ctg gcc gca gag gac ccc atc cgc agc cac tcc 3087 Leu Ser Ser Ala Ala Leu Ala Ala Glu Asp Pro Ile Arg Ser His Ser 510 515 520 ttc cgg aac acg ata ctg ggt tac ttt gac tat gcc ttc aca gcc atc 3135 Phe Arg Asn Thr Ile Leu Gly Tyr Phe Asp Tyr Ala Phe Thr Ala Ile 525 530 535 540 ttt act gtt gag atc ctg ttg aag atg aca act ttt gga gct ttc ctc 3183 Phe Thr Val Glu Ile Leu Leu Lys Met Thr Thr Phe Gly Ala Phe Leu 545 550 555 cac aaa ggg gcc ttc tgc agg aac tac ttc aat ttg ctg gat atg ctg 3231 His Lys Gly Ala Phe Cys Arg Asn Tyr Phe Asn Leu Leu Asp Met Leu 560 565 570 gtg gtt ggg gtg tct ctg gtg tca ttt ggg att caa tcc agt gcc atc 3279 Val Val Gly Val Ser Leu Val Ser Phe Gly Ile Gln Ser Ser Ala Ile 575 580 585 tcc gtt gtg aag att ctg agg gtc tta agg gtc ctg cgt ccc ctc agg 3327 Ser Val Val Lys Ile Leu Arg Val Leu Arg Val Leu Arg Pro Leu Arg 590 595 600 gcc atc aac aga gca aaa gga ctt aag cac gtg gtc cag tgc gtc ttc 3375 Ala Ile Asn Arg Ala Lys Gly Leu Lys His Val Val Gln Cys Val Phe 605 610 615 620 gtg gcc atc cgg acc atc ggc aac atc atg atc gtc acc acc ctc ctg 3423 Val Ala Ile Arg Thr Ile Gly Asn Ile Met Ile Val Thr Thr Leu Leu 625 630 635 cag ttc atg ttt gcc tgt atc ggg gtc cag ttg ttc aag ggg aag ttc 3471 Gln Phe Met Phe Ala Cys Ile Gly Val Gln Leu Phe Lys Gly Lys Phe 640 645 650 tat cgc tgt acg gat gaa gcc aaa agt aac cct gaa gaa tgc agg gga 3519 Tyr Arg Cys Thr Asp Glu Ala Lys Ser Asn Pro Glu Glu Cys Arg Gly 655 660 665 ctt ttc atc ctc tac aag gat ggg gat gtt gac agt cct gtg gtc cgt 3567 Leu Phe Ile Leu Tyr Lys Asp Gly Asp Val Asp Ser Pro Val Val Arg 670 675 680 gaa cgg atc tgg caa aac agt gat ttc aac ttc gac aac gtc ctc tct 3615 Glu Arg Ile Trp Gln Asn Ser Asp Phe Asn Phe Asp Asn Val Leu Ser 685 690 695 700 gct atg atg gcg ctc ttc aca gtc tcc acg ttt gag ggc tgg cct gcg 3663 Ala Met Met Ala Leu Phe Thr Val Ser Thr Phe Glu Gly Trp Pro Ala 705 710 715 ttg ctg tat aaa gcc atc gac tcg aat gga gag aac atc ggc cca atc 3711 Leu Leu Tyr Lys Ala Ile Asp Ser Asn Gly Glu Asn Ile Gly Pro Ile 720 725 730 tac aac cac cgc gtg gag atc tcc atc ttc ttc atc atc tac atc atc 3759 Tyr Asn His Arg Val Glu Ile Ser Ile Phe Phe Ile Ile Tyr Ile Ile 735 740 745 att gta gct ttc ttc atg atg aac atc ttt gtg ggc ttt gtc atc gtt 3807 Ile Val Ala Phe Phe Met Met Asn Ile Phe Val Gly Phe Val Ile Val 750 755 760 aca ttt cag gaa caa gga gaa aaa gag tat aag aac tgt gag ctg gac 3855 Thr Phe Gln Glu Gln Gly Glu Lys Glu Tyr Lys Asn Cys Glu Leu Asp 765 770 775 780 aaa aat cag cgt cag tgt gtt gaa tac gcc ttg aaa gca cgt ccc ttg 3903 Lys Asn Gln Arg Gln Cys Val Glu Tyr Ala Leu Lys Ala Arg Pro Leu 785 790 795 cgg aga tac atc ccc aaa aac ccc tac cag tac aag ttc tgg tac gtg 3951 Arg Arg Tyr Ile Pro Lys Asn Pro Tyr Gln Tyr Lys Phe Trp Tyr Val 800 805 810 gtg aac tct tcg cct ttc gaa tac atg atg ttt gtc ctc atc atg ctc 3999 Val Asn Ser Ser Pro Phe Glu Tyr Met Met Phe Val Leu Ile Met Leu 815 820 825 aac aca ctc tgc ttg gcc atg cag cac tac gag cag tcc aag atg ttc 4047 Asn Thr Leu Cys Leu Ala Met Gln His Tyr Glu Gln Ser Lys Met Phe 830 835 840 aat gat gcc atg gac att ctg aac atg gtc ttc acc ggg gtg ttc acc 4095 Asn Asp Ala Met Asp Ile Leu Asn Met Val Phe Thr Gly Val Phe Thr 845 850 855 860 gtc gag atg gtt ttg aaa gtc atc gca ttt aag cct aag ggg tat ttt 4143 Val Glu Met Val Leu Lys Val Ile Ala Phe Lys Pro Lys Gly Tyr Phe 865 870 875 agt gac gcc tgg aac acg ttt gac tcc ctc atc gta atc ggc agc att 4191 Ser Asp Ala Trp Asn Thr Phe Asp Ser Leu Ile Val Ile Gly Ser Ile 880 885 890 ata gac gtg gcc ctc agc gaa gca aag cca act gaa agt gaa aat gtc 4239 Ile Asp Val Ala Leu Ser Glu Ala Lys Pro Thr Glu Ser Glu Asn Val 895 900 905 cct gtc cca act gct aca cct ggg aac tct gaa gag agc aat aga atc 4287 Pro Val Pro Thr Ala Thr Pro Gly Asn Ser Glu Glu Ser Asn Arg Ile 910 915 920 tcc atc acc ttt ttc cgt ctt ttc cga gtg atg cga ttg gtg aag ctt 4335 Ser Ile Thr Phe Phe Arg Leu Phe Arg Val Met Arg Leu Val Lys Leu 925 930 935 940 ctc agc agg ggg gaa ggc atc cgg aca ttg ctg tgg act ttt att aag 4383 Leu Ser Arg Gly Glu Gly Ile Arg Thr Leu Leu Trp Thr Phe Ile Lys 945 950 955 tcc ttt cag gcg ctc ccg tat gtg gcc ctc ctc ata gcc atg ctg ttc 4431 Ser Phe Gln Ala Leu Pro Tyr Val Ala Leu Leu Ile Ala Met Leu Phe 960 965 970 ttc atc tat gcg gtc att ggc atg cag atg ttt ggg aaa gtt gcc atg 4479 Phe Ile Tyr Ala Val Ile Gly Met Gln Met Phe Gly Lys Val Ala Met 975 980 985 aga gat aac aac cag atc aat agg aac aat aac ttc cag acg ttt ccc 4527 Arg Asp Asn Asn Gln Ile Asn Arg Asn Asn Asn Phe Gln Thr Phe Pro 990 995 1000 cag gcg gtg ctg ctg ctc ttc agg tgt gca aca ggt gag gcc tgg 4572 Gln Ala Val Leu Leu Leu Phe Arg Cys Ala Thr Gly Glu Ala Trp 1005 1010 1015 cag gag atc atg ctg gcc tgt ctc cca ggg aag ctc tgt gac cct 4617 Gln Glu Ile Met Leu Ala Cys Leu Pro Gly Lys Leu Cys Asp Pro 1020 1025 1030 gag tca gat tac aac ccc ggg gag gag tat aca tgt ggg agc aac 4662 Glu Ser Asp Tyr Asn Pro Gly Glu Glu Tyr Thr Cys Gly Ser Asn 1035 1040 1045 ttt gcc att gtc tat ttc atc agt ttt tac atg ctc tgt gca ttt 4707 Phe Ala Ile Val Tyr Phe Ile Ser Phe Tyr Met Leu Cys Ala Phe 1050 1055 1060 ctg atc atc aat ctg ttt gtg gct gtc atc atg gat aat ttc gac 4752 Leu Ile Ile Asn Leu Phe Val Ala Val Ile Met Asp Asn Phe Asp 1065 1070 1075 tat ctg acc cgg gac tgg tct att ttg ggg cct cac cat tta gat 4797 Tyr Leu Thr Arg Asp Trp Ser Ile Leu Gly Pro His His Leu Asp 1080 1085 1090 gaa ttc aaa aga ata tgg tca gaa tat gac cct gag gca aag gga 4842 Glu Phe Lys Arg Ile Trp Ser Glu Tyr Asp Pro Glu Ala Lys Gly 1095 1100 1105 agg ata aaa cac ctt gat gtg gtc act ctg ctt cga cgc atc cag 4887 Arg Ile Lys His Leu Asp Val Val Thr Leu Leu Arg Arg Ile Gln 1110 1115 1120 cct ccc ctg ggg ttt ggg aag tta tgt cca cac agg gta gcg tgc 4932 Pro Pro Leu Gly Phe Gly Lys Leu Cys Pro His Arg Val Ala Cys 1125 1130 1135 aag aga tta gtt gcc atg aac atg cct ctc aac agt gac ggg aca 4977 Lys Arg Leu Val Ala Met Asn Met Pro Leu Asn Ser Asp Gly Thr 1140 1145 1150 gtc atg ttt aat gca acc ctg ttt gct ttg gtt cga acg gct ctt 5022 Val Met Phe Asn Ala Thr Leu Phe Ala Leu Val Arg Thr Ala Leu 1155 1160 1165 aag atc aag acc gaa ggg aac ctg gag caa gct aat gaa gaa ctt 5067 Lys Ile Lys Thr Glu Gly Asn Leu Glu Gln Ala Asn Glu Glu Leu 1170 1175 1180 cgg gct gtg ata aag aaa att tgg aag aaa acc agc atg aaa tta 5112 Arg Ala Val Ile Lys Lys Ile Trp Lys Lys Thr Ser Met Lys Leu 1185 1190 1195 ctt gac caa gtt gtc cct cca gct ggt gat gat gag gta acc gtg 5157 Leu Asp Gln Val Val Pro Pro Ala Gly Asp Asp Glu Val Thr Val 1200 1205 1210 ggg aag ttc tat gcc act ttc ctg ata cag gac tac ttt agg aaa 5202 Gly Lys Phe Tyr Ala Thr Phe Leu Ile Gln Asp Tyr Phe Arg Lys 1215 1220 1225 ttc aag aaa cgg aaa gaa caa gga ctg gtg gga aag tac cct gcg 5247 Phe Lys Lys Arg Lys Glu Gln Gly Leu Val Gly Lys Tyr Pro Ala 1230 1235 1240 aag aac acc aca att gcc cta cag gcg gga tta agg aca ctg cat 5292 Lys Asn Thr Thr Ile Ala Leu Gln Ala Gly Leu Arg Thr Leu His 1245 1250 1255 gac att ggg cca gaa atc cgg cgt gct ata tcg tgt gat ttg caa 5337 Asp Ile Gly Pro Glu Ile Arg Arg Ala Ile Ser Cys Asp Leu Gln 1260 1265 1270 gat gac gag cct gag gaa aca aaa cga gaa gaa gaa gat gat gtg 5382 Asp Asp Glu Pro Glu Glu Thr Lys Arg Glu Glu Glu Asp Asp Val 1275 1280 1285 ttc aaa aga aat ggt gcc ctg ctt gga aac cat gtc aat cat gtt 5427 Phe Lys Arg Asn Gly Ala Leu Leu Gly Asn His Val Asn His Val 1290 1295 1300 aat agt gat agg aga gat tcc ctt cag cag acc aat acc acc cac 5472 Asn Ser Asp Arg Arg Asp Ser Leu Gln Gln Thr Asn Thr Thr His 1305 1310 1315 cgt ccc ctg cat gtc caa agg cct tca att cca cct gca agt gat 5517 Arg Pro Leu His Val Gln Arg Pro Ser Ile Pro Pro Ala Ser Asp 1320 1325 1330 act gag aaa ccg ctg ttt cct cca gca gga aat tcg gtg tgt cat 5562 Thr Glu Lys Pro Leu Phe Pro Pro Ala Gly Asn Ser Val Cys His 1335 1340 1345 aac cat cat aac cat aat tcc ata gga aag caa gtt ccc acc tca 5607 Asn His His Asn His Asn Ser Ile Gly Lys Gln Val Pro Thr Ser 1350 1355 1360 aca aat gcc aat ctc aat aat gcc aat atg tcc aaa gct gcc cat 5652 Thr Asn Ala Asn Leu Asn Asn Ala Asn Met Ser Lys Ala Ala His 1365 1370 1375 gga aag cgg ccc agc att ggg aac ctt gag cat gtg tct gaa aat 5697 Gly Lys Arg Pro Ser Ile Gly Asn Leu Glu His Val Ser Glu Asn 1380 1385 1390 ggg cat cat tct tcc cac aag cat gac cgg gag cct cag aga agg 5742 Gly His His Ser Ser His Lys His Asp Arg Glu Pro Gln Arg Arg 1395 1400 1405 tcc agt gtg aaa aga acc cgc tat tat gaa act tac att agg tcc 5787 Ser Ser Val Lys Arg Thr Arg Tyr Tyr Glu Thr Tyr Ile Arg Ser 1410 1415 1420 gac tca gga gat gaa cag ctc cca act att tgc cgg gaa gac cca 5832 Asp Ser Gly Asp Glu Gln Leu Pro Thr Ile Cys Arg Glu Asp Pro 1425 1430 1435 gag ata cat ggc tat ttc agg gac ccc cac tgc ttg ggg gag cag 5877 Glu Ile His Gly Tyr Phe Arg Asp Pro His Cys Leu Gly Glu Gln 1440 1445 1450 gag tat ttc agt agt gag gaa tgc tac gag gat gac agc tcg ccc 5922 Glu Tyr Phe Ser Ser Glu Glu Cys Tyr Glu Asp Asp Ser Ser Pro 1455 1460 1465 acc tgg agc agg caa aac tat ggc tac tac agc aga tac cca ggc 5967 Thr Trp Ser Arg Gln Asn Tyr Gly Tyr Tyr Ser Arg Tyr Pro Gly 1470 1475 1480 aga aac atc gac tct gag agg ccc cga ggc tac cat cat ccc caa 6012 Arg Asn Ile Asp Ser Glu Arg Pro Arg Gly Tyr His His Pro Gln 1485 1490 1495 gga ttc ttg gag gac gat gac tcg ccc gtt tgc tat gat tca cgg 6057 Gly Phe Leu Glu Asp Asp Asp Ser Pro Val Cys Tyr Asp Ser Arg 1500 1505 1510 aga tct cca agg aga cgc cta cta cct ccc acc cca gca tcc cac 6102 Arg Ser Pro Arg Arg Arg Leu Leu Pro Pro Thr Pro Ala Ser His 1515 1520 1525 cgg aga tcc tcc ttc aac ttt gag tgc ctg cgc cgg cag agc agc 6147 Arg Arg Ser Ser Phe Asn Phe Glu Cys Leu Arg Arg Gln Ser Ser 1530 1535 1540 cag gaa gag gtc ccg tcg tct ccc atc ttc ccc cat cgc acg gcc 6192 Gln Glu Glu Val Pro Ser Ser Pro Ile Phe Pro His Arg Thr Ala 1545 1550 1555 ctg cct ctg cat cta atg cag caa cag atc atg gca gtt gcc ggc 6237 Leu Pro Leu His Leu Met Gln Gln Gln Ile Met Ala Val Ala Gly 1560 1565 1570 cta gat tca agt aaa gcc cag aag tac tca ccg agt cac tcg acc 6282 Leu Asp Ser Ser Lys Ala Gln Lys Tyr Ser Pro Ser His Ser Thr 1575 1580 1585 cgg tcg tgg gcc acc cct cca gca acc cct ccc tac cgg gac tgg 6327 Arg Ser Trp Ala Thr Pro Pro Ala Thr Pro Pro Tyr Arg Asp Trp 1590 1595 1600 aca ccg tgc tac acc ccc ctg atc caa gtg gag cag tca gag gcc 6372 Thr Pro Cys Tyr Thr Pro Leu Ile Gln Val Glu Gln Ser Glu Ala 1605 1610 1615 ctg gac cag gtg aac ggc agc ctg ccg tcc ctg cac cgc agc tcc 6417 Leu Asp Gln Val Asn Gly Ser Leu Pro Ser Leu His Arg Ser Ser 1620 1625 1630 tgg tac aca gac gag ccc gac atc tcc tac cgg act ttc aca cca 6462 Trp Tyr Thr Asp Glu Pro Asp Ile Ser Tyr Arg Thr Phe Thr Pro 1635 1640 1645 gcc agc ctg act gtc ccc agc agc ttc cgg aac aaa aac agc gac 6507 Ala Ser Leu Thr Val Pro Ser Ser Phe Arg Asn Lys Asn Ser Asp 1650 1655 1660 aag cag agg agt gcg gac agc ttg gtg gag gca gtc ctg ata tcc 6552 Lys Gln Arg Ser Ala Asp Ser Leu Val Glu Ala Val Leu Ile Ser 1665 1670 1675 gaa ggc ttg gga cgc tat gca agg gac cca aaa ttt gtg tca gca 6597 Glu Gly Leu Gly Arg Tyr Ala Arg Asp Pro Lys Phe Val Ser Ala 1680 1685 1690 aca aaa cac gaa atc gct gat gcc tgt gac ctc acc atc gac gag 6642 Thr Lys His Glu Ile Ala Asp Ala Cys Asp Leu Thr Ile Asp Glu 1695 1700 1705 atg gag agt gca gcc agc acc ctg ctt aat ggg aac gtg cgt ccc 6687 Met Glu Ser Ala Ala Ser Thr Leu Leu Asn Gly Asn Val Arg Pro 1710 1715 1720 cga gcc aac ggg gat gtg ggc ccc ctc tca cac cgg cag gac tat 6732 Arg Ala Asn Gly Asp Val Gly Pro Leu Ser His Arg Gln Asp Tyr 1725 1730 1735 gag cta cag gac ttt ggt cct ggc tac agc gac gaa gag cca gac 6777 Glu Leu Gln Asp Phe Gly Pro Gly Tyr Ser Asp Glu Glu Pro Asp 1740 1745 1750 cct ggg agg gat gag gag gac ctg gcg gat gaa atg ata tgc atc 6822 Pro Gly Arg Asp Glu Glu Asp Leu Ala Asp Glu Met Ile Cys Ile 1755 1760 1765 acc acc ttg tag cccccagcga ggggcagact ggctctggcc tcaggtgggg 6874 Thr Thr Leu 1770 cgcaggagag ccaggggaaa agtgcctcat agttaggaaa gtttaggcac tagttgggag 6934 taatattcaa ttaattagac ttttgtataa gagatgtcat gcctcaagaa agccataaac 6994 ctggtaggaa caggtcccaa gcggttgagc ctggcagagt accatgcgct cggccccagc 7054 tgcaggaaac agcaggcccc gccctctcac agaggatggg tgaggaggcc agacctgccc 7114 tgccccattg tccagatggg cactgctgtg gagtctgctt ctcccatgta ccagggcacc 7174 aggcccaccc aactgaaggc atggcggcgg ggtgcagggg aaagttaaag gtgatgacga 7234 tcatcacacc tgtgtcgtta cctcagccat cggtctagca tatcagtcac tgggcccaac 7294 atatccattt ttaaaccctt tcccccaaat acactgcgtc ctggttcctg tttagctgtt 7354 ctgaaata 7362 16 1772 PRT Homo sapiens misc_feature (383)..(383) The ′Xaa′ at location 383 stands for Glu, Asp, Gly, Ala, or Val. 16 Met Pro Thr Ser Glu Thr Glu Ser Val Asn Thr Glu Asn Val Ser Gly 1 5 10 15 Glu Gly Glu Asn Arg Gly Cys Cys Gly Ser Leu Trp Cys Trp Trp Arg 20 25 30 Arg Arg Gly Ala Ala Lys Ala Gly Pro Ser Gly Cys Arg Arg Trp Gly 35 40 45 Gln Ala Ile Ser Lys Ser Lys Leu Ser Arg Arg Trp Arg Arg Trp Asn 50 55 60 Arg Phe Asn Arg Arg Arg Cys Arg Ala Ala Val Lys Ser Val Thr Phe 65 70 75 80 Tyr Trp Leu Val Ile Val Leu Val Phe Leu Asn Thr Leu Thr Ile Ser 85 90 95 Ser Glu His Tyr Asn Gln Pro Asp Trp Leu Thr Gln Ile Gln Asp Met 100 105 110 Pro Asn Lys Val Leu Leu Ala Leu Phe Thr Cys Glu Met Leu Val Lys 115 120 125 Met Tyr Ser Leu Gly Leu Gln Ala Tyr Phe Val Ser Leu Phe Asn Arg 130 135 140 Phe Asp Cys Phe Val Val Cys Gly Gly Ile Thr Glu Thr Ile Leu Val 145 150 155 160 Glu Leu Glu Ile Met Ser Pro Leu Gly Ile Ser Val Phe Arg Cys Val 165 170 175 Arg Leu Leu Arg Ile Phe Lys Val Thr Arg His Trp Thr Ser Leu Ser 180 185 190 Asn Leu Val Ala Ser Leu Leu Asn Ser Met Lys Ser Ile Ala Ser Leu 195 200 205 Leu Leu Leu Leu Phe Leu Phe Ile Ile Ile Phe Ser Leu Leu Gly Met 210 215 220 Gln Leu Phe Gly Gly Lys Phe Asn Phe Asp Glu Thr Gln Thr Lys Arg 225 230 235 240 Ser Thr Phe Asp Asn Phe Pro Gln Ala Leu Leu Thr Val Phe Gln Ile 245 250 255 Leu Thr Gly Glu Asp Trp Asn Ala Val Met Tyr Asp Gly Ile Met Ala 260 265 270 Tyr Gly Gly Pro Ser Ser Ser Gly Met Ile Val Cys Ile Tyr Phe Ile 275 280 285 Ile Leu Phe Ile Cys Gly Asn Tyr Ile Leu Leu Asn Val Phe Leu Ala 290 295 300 Ile Ala Val Asp Asn Leu Ala Asp Ala Glu Ser Leu Asn Thr Ala Gln 305 310 315 320 Lys Glu Glu Ala Glu Glu Lys Glu Arg Lys Lys Ile Ala Arg Lys Glu 325 330 335 Ser Leu Glu Asn Lys Lys Asn Asn Lys Pro Glu Val Asn Gln Ile Ala 340 345 350 Asn Ser Asp Asn Lys Val Thr Ile Asp Asp Tyr Arg Glu Glu Asp Glu 355 360 365 Asp Lys Asp Pro Tyr Pro Pro Cys Asp Val Pro Gly Met Val Xaa Xaa 370 375 380 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 385 390 395 400 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 405 410 415 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 420 425 430 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 435 440 445 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 450 455 460 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 465 470 475 480 Xaa Xaa Pro Cys Arg Ile Arg Val Gly Cys His Lys Leu Ile Asn His 485 490 495 His Ile Phe Thr Asn Leu Ile Leu Val Phe Ile Met Leu Ser Ser Ala 500 505 510 Ala Leu Ala Ala Glu Asp Pro Ile Arg Ser His Ser Phe Arg Asn Thr 515 520 525 Ile Leu Gly Tyr Phe Asp Tyr Ala Phe Thr Ala Ile Phe Thr Val Glu 530 535 540 Ile Leu Leu Lys Met Thr Thr Phe Gly Ala Phe Leu His Lys Gly Ala 545 550 555 560 Phe Cys Arg Asn Tyr Phe Asn Leu Leu Asp Met Leu Val Val Gly Val 565 570 575 Ser Leu Val Ser Phe Gly Ile Gln Ser Ser Ala Ile Ser Val Val Lys 580 585 590 Ile Leu Arg Val Leu Arg Val Leu Arg Pro Leu Arg Ala Ile Asn Arg 595 600 605 Ala Lys Gly Leu Lys His Val Val Gln Cys Val Phe Val Ala Ile Arg 610 615 620 Thr Ile Gly Asn Ile Met Ile Val Thr Thr Leu Leu Gln Phe Met Phe 625 630 635 640 Ala Cys Ile Gly Val Gln Leu Phe Lys Gly Lys Phe Tyr Arg Cys Thr 645 650 655 Asp Glu Ala Lys Ser Asn Pro Glu Glu Cys Arg Gly Leu Phe Ile Leu 660 665 670 Tyr Lys Asp Gly Asp Val Asp Ser Pro Val Val Arg Glu Arg Ile Trp 675 680 685 Gln Asn Ser Asp Phe Asn Phe Asp Asn Val Leu Ser Ala Met Met Ala 690 695 700 Leu Phe Thr Val Ser Thr Phe Glu Gly Trp Pro Ala Leu Leu Tyr Lys 705 710 715 720 Ala Ile Asp Ser Asn Gly Glu Asn Ile Gly Pro Ile Tyr Asn His Arg 725 730 735 Val Glu Ile Ser Ile Phe Phe Ile Ile Tyr Ile Ile Ile Val Ala Phe 740 745 750 Phe Met Met Asn Ile Phe Val Gly Phe Val Ile Val Thr Phe Gln Glu 755 760 765 Gln Gly Glu Lys Glu Tyr Lys Asn Cys Glu Leu Asp Lys Asn Gln Arg 770 775 780 Gln Cys Val Glu Tyr Ala Leu Lys Ala Arg Pro Leu Arg Arg Tyr Ile 785 790 795 800 Pro Lys Asn Pro Tyr Gln Tyr Lys Phe Trp Tyr Val Val Asn Ser Ser 805 810 815 Pro Phe Glu Tyr Met Met Phe Val Leu Ile Met Leu Asn Thr Leu Cys 820 825 830 Leu Ala Met Gln His Tyr Glu Gln Ser Lys Met Phe Asn Asp Ala Met 835 840 845 Asp Ile Leu Asn Met Val Phe Thr Gly Val Phe Thr Val Glu Met Val 850 855 860 Leu Lys Val Ile Ala Phe Lys Pro Lys Gly Tyr Phe Ser Asp Ala Trp 865 870 875 880 Asn Thr Phe Asp Ser Leu Ile Val Ile Gly Ser Ile Ile Asp Val Ala 885 890 895 Leu Ser Glu Ala Lys Pro Thr Glu Ser Glu Asn Val Pro Val Pro Thr 900 905 910 Ala Thr Pro Gly Asn Ser Glu Glu Ser Asn Arg Ile Ser Ile Thr Phe 915 920 925 Phe Arg Leu Phe Arg Val Met Arg Leu Val Lys Leu Leu Ser Arg Gly 930 935 940 Glu Gly Ile Arg Thr Leu Leu Trp Thr Phe Ile Lys Ser Phe Gln Ala 945 950 955 960 Leu Pro Tyr Val Ala Leu Leu Ile Ala Met Leu Phe Phe Ile Tyr Ala 965 970 975 Val Ile Gly Met Gln Met Phe Gly Lys Val Ala Met Arg Asp Asn Asn 980 985 990 Gln Ile Asn Arg Asn Asn Asn Phe Gln Thr Phe Pro Gln Ala Val Leu 995 1000 1005 Leu Leu Phe Arg Cys Ala Thr Gly Glu Ala Trp Gln Glu Ile Met 1010 1015 1020 Leu Ala Cys Leu Pro Gly Lys Leu Cys Asp Pro Glu Ser Asp Tyr 1025 1030 1035 Asn Pro Gly Glu Glu Tyr Thr Cys Gly Ser Asn Phe Ala Ile Val 1040 1045 1050 Tyr Phe Ile Ser Phe Tyr Met Leu Cys Ala Phe Leu Ile Ile Asn 1055 1060 1065 Leu Phe Val Ala Val Ile Met Asp Asn Phe Asp Tyr Leu Thr Arg 1070 1075 1080 Asp Trp Ser Ile Leu Gly Pro His His Leu Asp Glu Phe Lys Arg 1085 1090 1095 Ile Trp Ser Glu Tyr Asp Pro Glu Ala Lys Gly Arg Ile Lys His 1100 1105 1110 Leu Asp Val Val Thr Leu Leu Arg Arg Ile Gln Pro Pro Leu Gly 1115 1120 1125 Phe Gly Lys Leu Cys Pro His Arg Val Ala Cys Lys Arg Leu Val 1130 1135 1140 Ala Met Asn Met Pro Leu Asn Ser Asp Gly Thr Val Met Phe Asn 1145 1150 1155 Ala Thr Leu Phe Ala Leu Val Arg Thr Ala Leu Lys Ile Lys Thr 1160 1165 1170 Glu Gly Asn Leu Glu Gln Ala Asn Glu Glu Leu Arg Ala Val Ile 1175 1180 1185 Lys Lys Ile Trp Lys Lys Thr Ser Met Lys Leu Leu Asp Gln Val 1190 1195 1200 Val Pro Pro Ala Gly Asp Asp Glu Val Thr Val Gly Lys Phe Tyr 1205 1210 1215 Ala Thr Phe Leu Ile Gln Asp Tyr Phe Arg Lys Phe Lys Lys Arg 1220 1225 1230 Lys Glu Gln Gly Leu Val Gly Lys Tyr Pro Ala Lys Asn Thr Thr 1235 1240 1245 Ile Ala Leu Gln Ala Gly Leu Arg Thr Leu His Asp Ile Gly Pro 1250 1255 1260 Glu Ile Arg Arg Ala Ile Ser Cys Asp Leu Gln Asp Asp Glu Pro 1265 1270 1275 Glu Glu Thr Lys Arg Glu Glu Glu Asp Asp Val Phe Lys Arg Asn 1280 1285 1290 Gly Ala Leu Leu Gly Asn His Val Asn His Val Asn Ser Asp Arg 1295 1300 1305 Arg Asp Ser Leu Gln Gln Thr Asn Thr Thr His Arg Pro Leu His 1310 1315 1320 Val Gln Arg Pro Ser Ile Pro Pro Ala Ser Asp Thr Glu Lys Pro 1325 1330 1335 Leu Phe Pro Pro Ala Gly Asn Ser Val Cys His Asn His His Asn 1340 1345 1350 His Asn Ser Ile Gly Lys Gln Val Pro Thr Ser Thr Asn Ala Asn 1355 1360 1365 Leu Asn Asn Ala Asn Met Ser Lys Ala Ala His Gly Lys Arg Pro 1370 1375 1380 Ser Ile Gly Asn Leu Glu His Val Ser Glu Asn Gly His His Ser 1385 1390 1395 Ser His Lys His Asp Arg Glu Pro Gln Arg Arg Ser Ser Val Lys 1400 1405 1410 Arg Thr Arg Tyr Tyr Glu Thr Tyr Ile Arg Ser Asp Ser Gly Asp 1415 1420 1425 Glu Gln Leu Pro Thr Ile Cys Arg Glu Asp Pro Glu Ile His Gly 1430 1435 1440 Tyr Phe Arg Asp Pro His Cys Leu Gly Glu Gln Glu Tyr Phe Ser 1445 1450 1455 Ser Glu Glu Cys Tyr Glu Asp Asp Ser Ser Pro Thr Trp Ser Arg 1460 1465 1470 Gln Asn Tyr Gly Tyr Tyr Ser Arg Tyr Pro Gly Arg Asn Ile Asp 1475 1480 1485 Ser Glu Arg Pro Arg Gly Tyr His His Pro Gln Gly Phe Leu Glu 1490 1495 1500 Asp Asp Asp Ser Pro Val Cys Tyr Asp Ser Arg Arg Ser Pro Arg 1505 1510 1515 Arg Arg Leu Leu Pro Pro Thr Pro Ala Ser His Arg Arg Ser Ser 1520 1525 1530 Phe Asn Phe Glu Cys Leu Arg Arg Gln Ser Ser Gln Glu Glu Val 1535 1540 1545 Pro Ser Ser Pro Ile Phe Pro His Arg Thr Ala Leu Pro Leu His 1550 1555 1560 Leu Met Gln Gln Gln Ile Met Ala Val Ala Gly Leu Asp Ser Ser 1565 1570 1575 Lys Ala Gln Lys Tyr Ser Pro Ser His Ser Thr Arg Ser Trp Ala 1580 1585 1590 Thr Pro Pro Ala Thr Pro Pro Tyr Arg Asp Trp Thr Pro Cys Tyr 1595 1600 1605 Thr Pro Leu Ile Gln Val Glu Gln Ser Glu Ala Leu Asp Gln Val 1610 1615 1620 Asn Gly Ser Leu Pro Ser Leu His Arg Ser Ser Trp Tyr Thr Asp 1625 1630 1635 Glu Pro Asp Ile Ser Tyr Arg Thr Phe Thr Pro Ala Ser Leu Thr 1640 1645 1650 Val Pro Ser Ser Phe Arg Asn Lys Asn Ser Asp Lys Gln Arg Ser 1655 1660 1665 Ala Asp Ser Leu Val Glu Ala Val Leu Ile Ser Glu Gly Leu Gly 1670 1675 1680 Arg Tyr Ala Arg Asp Pro Lys Phe Val Ser Ala Thr Lys His Glu 1685 1690 1695 Ile Ala Asp Ala Cys Asp Leu Thr Ile Asp Glu Met Glu Ser Ala 1700 1705 1710 Ala Ser Thr Leu Leu Asn Gly Asn Val Arg Pro Arg Ala Asn Gly 1715 1720 1725 Asp Val Gly Pro Leu Ser His Arg Gln Asp Tyr Glu Leu Gln Asp 1730 1735 1740 Phe Gly Pro Gly Tyr Ser Asp Glu Glu Pro Asp Pro Gly Arg Asp 1745 1750 1755 Glu Glu Asp Leu Ala Asp Glu Met Ile Cys Ile Thr Thr Leu 1760 1765 1770 17 7193 DNA Homo sapiens gene (1)..(7193) 17 agaataaggg cagggaccgc ggctcctatc tcttggtgat ccccttcccc attccgcccc 60 cgcctcaacg cccagcacag tgccctgcac acagtagtcg ctcaataaat gttcgtgg 118 atg atg atg atg atg atg atg aaa aaa atg cag cat caa cgg cag cag 166 Met Met Met Met Met Met Met Lys Lys Met Gln His Gln Arg Gln Gln 1 5 10 15 caa gcg gac cac gcg aac gag gca aac tat gca aga ggc acc aga ctt 214 Gln Ala Asp His Ala Asn Glu Ala Asn Tyr Ala Arg Gly Thr Arg Leu 20 25 30 cct ctt tct ggt gaa gga cca act tct cag ccg aat agc tcc aag caa 262 Pro Leu Ser Gly Glu Gly Pro Thr Ser Gln Pro Asn Ser Ser Lys Gln 35 40 45 act gtc ctg tct tgg caa gct gca atc gat gct gct aga cag gcc aag 310 Thr Val Leu Ser Trp Gln Ala Ala Ile Asp Ala Ala Arg Gln Ala Lys 50 55 60 gct gcc caa act atg agc acc tct gca ccc cca cct gta gga tct ctc 358 Ala Ala Gln Thr Met Ser Thr Ser Ala Pro Pro Pro Val Gly Ser Leu 65 70 75 80 tcc caa aga aaa cgt cag caa tac gcc aag agc aaa aaa cag ggt aac 406 Ser Gln Arg Lys Arg Gln Gln Tyr Ala Lys Ser Lys Lys Gln Gly Asn 85 90 95 tcg tcc aac agc cga cct gcc cgc gcc ctt ttc tgt tta tca ctc aat 454 Ser Ser Asn Ser Arg Pro Ala Arg Ala Leu Phe Cys Leu Ser Leu Asn 100 105 110 aac ccc atc cga aga gcc tgc att agt ata gtg gaa tgg aaa cca ttt 502 Asn Pro Ile Arg Arg Ala Cys Ile Ser Ile Val Glu Trp Lys Pro Phe 115 120 125 gac ata ttt ata tta ttg gct att ttt gcc aat tgt gtg gcc tta gct 550 Asp Ile Phe Ile Leu Leu Ala Ile Phe Ala Asn Cys Val Ala Leu Ala 130 135 140 att tac atc cca ttc cct gaa gat gat tct aat tca aca aat cat aac 598 Ile Tyr Ile Pro Phe Pro Glu Asp Asp Ser Asn Ser Thr Asn His Asn 145 150 155 160 ttg gaa aaa gta gaa tat gcc ttc ctg att att ttt aca gtc gag aca 646 Leu Glu Lys Val Glu Tyr Ala Phe Leu Ile Ile Phe Thr Val Glu Thr 165 170 175 ttt ttg aag att ata gcg tat gga tta ttg cta cat cct aat gct tat 694 Phe Leu Lys Ile Ile Ala Tyr Gly Leu Leu Leu His Pro Asn Ala Tyr 180 185 190 gtt agg aat gga tgg aat tta ctg gat ttt gtt ata gta ata gta gga 742 Val Arg Asn Gly Trp Asn Leu Leu Asp Phe Val Ile Val Ile Val Gly 195 200 205 ttg ttt agt gta att ttg gaa caa tta acc aaa gaa aca gaa ggc ggg 790 Leu Phe Ser Val Ile Leu Glu Gln Leu Thr Lys Glu Thr Glu Gly Gly 210 215 220 aac cac tca agc ggc aaa tct gga ggc ttt gat gtc aaa gcc ctc cgt 838 Asn His Ser Ser Gly Lys Ser Gly Gly Phe Asp Val Lys Ala Leu Arg 225 230 235 240 gcc ttt cga gtg ttg cga cca ctt cga cta gtg tca ggg gtg ccc agt 886 Ala Phe Arg Val Leu Arg Pro Leu Arg Leu Val Ser Gly Val Pro Ser 245 250 255 tta caa gtt gtc ctg aac tcc att ata aaa gcc atg gtt ccc ctc ctt 934 Leu Gln Val Val Leu Asn Ser Ile Ile Lys Ala Met Val Pro Leu Leu 260 265 270 cac ata gcc ctt ttg gta tta ttt gta atc ata atc tat gct att ata 982 His Ile Ala Leu Leu Val Leu Phe Val Ile Ile Ile Tyr Ala Ile Ile 275 280 285 gga ttg gaa ctt ttt att gga aaa atg cac aaa aca tgt ttt ttt gct 1030 Gly Leu Glu Leu Phe Ile Gly Lys Met His Lys Thr Cys Phe Phe Ala 290 295 300 gac tca gat atc gta gct gaa gag gac cca gct cca tgt gcg ttc tca 1078 Asp Ser Asp Ile Val Ala Glu Glu Asp Pro Ala Pro Cys Ala Phe Ser 305 310 315 320 ggg aat gga cgc cag tgt act gcc aat ggc acg gaa tgt agg agt ggc 1126 Gly Asn Gly Arg Gln Cys Thr Ala Asn Gly Thr Glu Cys Arg Ser Gly 325 330 335 tgg gtt ggc ccg aac gga ggc atc acc aac ttt gat aac ttt gcc ttt 1174 Trp Val Gly Pro Asn Gly Gly Ile Thr Asn Phe Asp Asn Phe Ala Phe 340 345 350 gcc atg ctt act gtg ttt cag tgc atc acc atg gag ggc tgg aca gac 1222 Ala Met Leu Thr Val Phe Gln Cys Ile Thr Met Glu Gly Trp Thr Asp 355 360 365 gtg ctc tac tgg gta aat gat gcg ata gga tgg gaa tgg cca tgg gtg 1270 Val Leu Tyr Trp Val Asn Asp Ala Ile Gly Trp Glu Trp Pro Trp Val 370 375 380 tat ttt gtt agt ctg atc atc ctt ggc tca ttt ttc gtc ctt aac ctg 1318 Tyr Phe Val Ser Leu Ile Ile Leu Gly Ser Phe Phe Val Leu Asn Leu 385 390 395 400 gtt ctt ggt gtc ctt agt gga gaa ttc tca aag gaa aga gag aag gca 1366 Val Leu Gly Val Leu Ser Gly Glu Phe Ser Lys Glu Arg Glu Lys Ala 405 410 415 aaa gca cgg gga gat ttc cag aag ctc cgg gag aag cag cag ctg gag 1414 Lys Ala Arg Gly Asp Phe Gln Lys Leu Arg Glu Lys Gln Gln Leu Glu 420 425 430 gag gat cta aag ggc tac ttg gat tgg atc acc caa gct gag gac atc 1462 Glu Asp Leu Lys Gly Tyr Leu Asp Trp Ile Thr Gln Ala Glu Asp Ile 435 440 445 gat ccg gag aat gag gaa gaa gga gga gag gaa ggc aaa cga aat act 1510 Asp Pro Glu Asn Glu Glu Glu Gly Gly Glu Glu Gly Lys Arg Asn Thr 450 455 460 agc atg ccc acc agc gag act gag tct gtg aac aca gag aac gtc agc 1558 Ser Met Pro Thr Ser Glu Thr Glu Ser Val Asn Thr Glu Asn Val Ser 465 470 475 480 ggt gaa ggc gag aac cga ggc tgc tgt gga agt ctc tgg tgc tgg tgg 1606 Gly Glu Gly Glu Asn Arg Gly Cys Cys Gly Ser Leu Trp Cys Trp Trp 485 490 495 aga cgg aga ggc gcg gcc aag gcg ggg ccc tct ggg tgt cgg cgg tgg 1654 Arg Arg Arg Gly Ala Ala Lys Ala Gly Pro Ser Gly Cys Arg Arg Trp 500 505 510 ggt caa gcc atc tca aaa tcc aaa ctc agc cga cgc tgg cgt cgc tgg 1702 Gly Gln Ala Ile Ser Lys Ser Lys Leu Ser Arg Arg Trp Arg Arg Trp 515 520 525 aac cga ttc aat cgc aga aga tgt agg gcc gcc gtg aag tct gtc acg 1750 Asn Arg Phe Asn Arg Arg Arg Cys Arg Ala Ala Val Lys Ser Val Thr 530 535 540 ttt tac tgg ctg gtt atc gtc ctg gtg ttt ctg aac acc tta acc att 1798 Phe Tyr Trp Leu Val Ile Val Leu Val Phe Leu Asn Thr Leu Thr Ile 545 550 555 560 tcc tct gag cac tac aat cag cca gat tgg ttg aca cag att caa gat 1846 Ser Ser Glu His Tyr Asn Gln Pro Asp Trp Leu Thr Gln Ile Gln Asp 565 570 575 att gcc aac aaa gtc ctc ttg gct ctg ttc acc tgc gag atg ctg gta 1894 Ile Ala Asn Lys Val Leu Leu Ala Leu Phe Thr Cys Glu Met Leu Val 580 585 590 aaa atg tac agc ttg ggc ctc caa gca tat ttc gtc tct ctt ttc aac 1942 Lys Met Tyr Ser Leu Gly Leu Gln Ala Tyr Phe Val Ser Leu Phe Asn 595 600 605 cgg ttt gat tgc ttc gtg gtg tgt ggt gga atc act gag acg atc ctg 1990 Arg Phe Asp Cys Phe Val Val Cys Gly Gly Ile Thr Glu Thr Ile Leu 610 615 620 gtg gaa ctg gaa atc atg tct ccc ctg ggg atc tct gtg ttt cgg tgt 2038 Val Glu Leu Glu Ile Met Ser Pro Leu Gly Ile Ser Val Phe Arg Cys 625 630 635 640 gtg cgc ctc tta aga atc ttc aaa gtg acc agg cac tgg act tcc ctg 2086 Val Arg Leu Leu Arg Ile Phe Lys Val Thr Arg His Trp Thr Ser Leu 645 650 655 agc aac tta gtg gca tcc tta tta aac tcc atg aag tcc atc gct tcg 2134 Ser Asn Leu Val Ala Ser Leu Leu Asn Ser Met Lys Ser Ile Ala Ser 660 665 670 ctg ttg ctt ctg ctt ttt ctc ttc att atc atc ttt tcc ttg ctt ggg 2182 Leu Leu Leu Leu Leu Phe Leu Phe Ile Ile Ile Phe Ser Leu Leu Gly 675 680 685 atg cag ctg ttt ggc ggc aag ttt aat ttt gat gaa acg caa acc aag 2230 Met Gln Leu Phe Gly Gly Lys Phe Asn Phe Asp Glu Thr Gln Thr Lys 690 695 700 cgg agc acc ttt gac aat ttc cct caa gca ctt ctc aca gtg ttc cag 2278 Arg Ser Thr Phe Asp Asn Phe Pro Gln Ala Leu Leu Thr Val Phe Gln 705 710 715 720 atc ctg aca ggc gaa gac tgg aat gct gtg atg tac gat ggc atc atg 2326 Ile Leu Thr Gly Glu Asp Trp Asn Ala Val Met Tyr Asp Gly Ile Met 725 730 735 gct tac ggg ggc cca tcc tct tca gga atg atc gtc tgc atc tac ttc 2374 Ala Tyr Gly Gly Pro Ser Ser Ser Gly Met Ile Val Cys Ile Tyr Phe 740 745 750 atc atc ctc ttc att tgt ggt aac tat att cta ctg aat gtc ttc ttg 2422 Ile Ile Leu Phe Ile Cys Gly Asn Tyr Ile Leu Leu Asn Val Phe Leu 755 760 765 gcc atc gct gta gac aat ttg gct gat gct gaa agt ctg aac act gct 2470 Ala Ile Ala Val Asp Asn Leu Ala Asp Ala Glu Ser Leu Asn Thr Ala 770 775 780 cag aaa gaa gaa gcg gaa gaa aag gag agg aaa aag att gcc aga aaa 2518 Gln Lys Glu Glu Ala Glu Glu Lys Glu Arg Lys Lys Ile Ala Arg Lys 785 790 795 800 gag agc cta gaa aat aaa aag aac aac aaa cca gaa gtc aac cag ata 2566 Glu Ser Leu Glu Asn Lys Lys Asn Asn Lys Pro Glu Val Asn Gln Ile 805 810 815 gcc aac agt gac aac aag gtt aca att gat gac tat aga gaa gag gat 2614 Ala Asn Ser Asp Asn Lys Val Thr Ile Asp Asp Tyr Arg Glu Glu Asp 820 825 830 gaa gac aag gac ccc tat ccg cct tgc gat gtg cca gta ggg gaa gag 2662 Glu Asp Lys Asp Pro Tyr Pro Pro Cys Asp Val Pro Val Gly Glu Glu 835 840 845 gaa gag gaa gag gag gag gat gaa cct gag gtt cct gcc gga ccc cgt 2710 Glu Glu Glu Glu Glu Glu Asp Glu Pro Glu Val Pro Ala Gly Pro Arg 850 855 860 cct cga agg atc tcg gag ttg aac atg aag gaa aaa att gcc ccc atc 2758 Pro Arg Arg Ile Ser Glu Leu Asn Met Lys Glu Lys Ile Ala Pro Ile 865 870 875 880 cct gaa ggg agc gct ttc ttc att ctt agc aag acc aac ccg atc cgc 2806 Pro Glu Gly Ser Ala Phe Phe Ile Leu Ser Lys Thr Asn Pro Ile Arg 885 890 895 gta ggc tgc cac aag ctc atc aac cac cac atc ttc acc aac ctc atc 2854 Val Gly Cys His Lys Leu Ile Asn His His Ile Phe Thr Asn Leu Ile 900 905 910 ctt gtc ttc atc atg ctg agc agc gct gcc ctg gcc gca gag gac ccc 2902 Leu Val Phe Ile Met Leu Ser Ser Ala Ala Leu Ala Ala Glu Asp Pro 915 920 925 atc cgc agc cac tcc ttc cgg aac acg ata ctg ggt tac ttt gac tat 2950 Ile Arg Ser His Ser Phe Arg Asn Thr Ile Leu Gly Tyr Phe Asp Tyr 930 935 940 gcc ttc aca gcc atc ttt act gtt gag atc ctg ttg aag atg aca act 2998 Ala Phe Thr Ala Ile Phe Thr Val Glu Ile Leu Leu Lys Met Thr Thr 945 950 955 960 ttt gga gct ttc ctc cac aaa ggg gcc ttc tgc agg aac tac ttc aat 3046 Phe Gly Ala Phe Leu His Lys Gly Ala Phe Cys Arg Asn Tyr Phe Asn 965 970 975 ttg ctg gat atg ctg gtg gtt ggg gtg tct ctg gtg tca ttt ggg att 3094 Leu Leu Asp Met Leu Val Val Gly Val Ser Leu Val Ser Phe Gly Ile 980 985 990 caa tcc agt gcc atc tcc gtt gtg aag att ctg agg gtc tta agg gtc 3142 Gln Ser Ser Ala Ile Ser Val Val Lys Ile Leu Arg Val Leu Arg Val 995 1000 1005 ctg cgt ccc ctc agg gcc atc aac aga gca aaa gga ctt aag cac 3187 Leu Arg Pro Leu Arg Ala Ile Asn Arg Ala Lys Gly Leu Lys His 1010 1015 1020 gtg gtc cag tgc gtc ttc gtg gcc atc cgg acc atc ggc aac atc 3232 Val Val Gln Cys Val Phe Val Ala Ile Arg Thr Ile Gly Asn Ile 1025 1030 1035 atg atc gtc act acc ctc ctg cag ttc atg ttt gcc tgt atc ggg 3277 Met Ile Val Thr Thr Leu Leu Gln Phe Met Phe Ala Cys Ile Gly 1040 1045 1050 gtc cag ttg ttc aag ggg aag ttc tat cgc tgt acg gat gaa gcc 3322 Val Gln Leu Phe Lys Gly Lys Phe Tyr Arg Cys Thr Asp Glu Ala 1055 1060 1065 aaa agt aac cct gaa gaa tgc agg gga ctt ttc atc ctc tac aag 3367 Lys Ser Asn Pro Glu Glu Cys Arg Gly Leu Phe Ile Leu Tyr Lys 1070 1075 1080 gat ggg gat gtt gac agt cct gtg gtc cgt gaa cgg atc tgg caa 3412 Asp Gly Asp Val Asp Ser Pro Val Val Arg Glu Arg Ile Trp Gln 1085 1090 1095 aac agt gat ttc aac ttc gac aac gtc ctc tct gct atg atg gcg 3457 Asn Ser Asp Phe Asn Phe Asp Asn Val Leu Ser Ala Met Met Ala 1100 1105 1110 ctc ttc aca gtc tcc acg ttt gag ggc tgg cct gcg ttg ctg tat 3502 Leu Phe Thr Val Ser Thr Phe Glu Gly Trp Pro Ala Leu Leu Tyr 1115 1120 1125 aaa gcc atc gac tcg aat gga gag aac atc ggc cca atc tac aac 3547 Lys Ala Ile Asp Ser Asn Gly Glu Asn Ile Gly Pro Ile Tyr Asn 1130 1135 1140 cac cgc gtg gag atc tcc atc ttc ttc atc atc tac atc atc att 3592 His Arg Val Glu Ile Ser Ile Phe Phe Ile Ile Tyr Ile Ile Ile 1145 1150 1155 gta gct ttc ttc atg atg aac atc ttt gtg ggc ttt gtc atc gtt 3637 Val Ala Phe Phe Met Met Asn Ile Phe Val Gly Phe Val Ile Val 1160 1165 1170 aca ttt cag gaa caa gga gaa aaa gag tat aag aac tgt gag ctg 3682 Thr Phe Gln Glu Gln Gly Glu Lys Glu Tyr Lys Asn Cys Glu Leu 1175 1180 1185 gac aaa aat cag cgt cag tgt gtt gaa tac gcc ttg aaa gca cgt 3727 Asp Lys Asn Gln Arg Gln Cys Val Glu Tyr Ala Leu Lys Ala Arg 1190 1195 1200 ccc ttg cgg aga tac atc ccc aaa aac ccc tac cag tac aag ttc 3772 Pro Leu Arg Arg Tyr Ile Pro Lys Asn Pro Tyr Gln Tyr Lys Phe 1205 1210 1215 tgg tac gtg gtg aac tct tcg cct ttc gaa tac atg atg ttt gtc 3817 Trp Tyr Val Val Asn Ser Ser Pro Phe Glu Tyr Met Met Phe Val 1220 1225 1230 ctc atc atg ctc aac aca ctc tgc ttg gcc atg cag cac tac gag 3862 Leu Ile Met Leu Asn Thr Leu Cys Leu Ala Met Gln His Tyr Glu 1235 1240 1245 cag tcc aag atg ttc aat gat gcc atg gac att ctg aac atg gtc 3907 Gln Ser Lys Met Phe Asn Asp Ala Met Asp Ile Leu Asn Met Val 1250 1255 1260 ttc acc ggg gtg ttc acc gtc gag atg gtt ttg aaa gtc atc gca 3952 Phe Thr Gly Val Phe Thr Val Glu Met Val Leu Lys Val Ile Ala 1265 1270 1275 ttt aag cct aag ggg tat ttt agt gac gcc tgg aac acg ttt gac 3997 Phe Lys Pro Lys Gly Tyr Phe Ser Asp Ala Trp Asn Thr Phe Asp 1280 1285 1290 tcc ctc atc gta atc ggc agc att ata gac gtg gcc ctc agc gaa 4042 Ser Leu Ile Val Ile Gly Ser Ile Ile Asp Val Ala Leu Ser Glu 1295 1300 1305 gcg gac cca act gaa agt gaa aat gtc cct gtc cca act gct aca 4087 Ala Asp Pro Thr Glu Ser Glu Asn Val Pro Val Pro Thr Ala Thr 1310 1315 1320 cct ggg aac tct gaa gag agc aat aga atc tcc atc acc ttt ttc 4132 Pro Gly Asn Ser Glu Glu Ser Asn Arg Ile Ser Ile Thr Phe Phe 1325 1330 1335 cgt ctt ttc cga gtg atg cga ttg gtg aag ctt ctc agc agg ggg 4177 Arg Leu Phe Arg Val Met Arg Leu Val Lys Leu Leu Ser Arg Gly 1340 1345 1350 gaa ggc atc cgg aca ttg ctg tgg act ttt att aag tcc ttt cag 4222 Glu Gly Ile Arg Thr Leu Leu Trp Thr Phe Ile Lys Ser Phe Gln 1355 1360 1365 gcg ctc ccg tat gtg gcc ctc ctc ata gcc atg ctg ttc ttc atc 4267 Ala Leu Pro Tyr Val Ala Leu Leu Ile Ala Met Leu Phe Phe Ile 1370 1375 1380 tat gcg gtc att ggc atg cag atg ttt ggg aaa gtt gcc atg aga 4312 Tyr Ala Val Ile Gly Met Gln Met Phe Gly Lys Val Ala Met Arg 1385 1390 1395 gat aac aac cag atc aat agg aac aat aac ttc cag acg ttt ccc 4357 Asp Asn Asn Gln Ile Asn Arg Asn Asn Asn Phe Gln Thr Phe Pro 1400 1405 1410 cag gcg gtg ctg ctg ctc ttc agg tgt gca aca ggt gag gcc tgg 4402 Gln Ala Val Leu Leu Leu Phe Arg Cys Ala Thr Gly Glu Ala Trp 1415 1420 1425 cag gag atc atg ctg gcc tgt ctc cca ggg aag ctc tgt gac cct 4447 Gln Glu Ile Met Leu Ala Cys Leu Pro Gly Lys Leu Cys Asp Pro 1430 1435 1440 gag tca gat tac aac ccc ggg gag gag tat aca tgt ggg agc aac 4492 Glu Ser Asp Tyr Asn Pro Gly Glu Glu Tyr Thr Cys Gly Ser Asn 1445 1450 1455 ttt gcc att gtc tat ttc atc agt ttt tac atg ctc tgt gca ttt 4537 Phe Ala Ile Val Tyr Phe Ile Ser Phe Tyr Met Leu Cys Ala Phe 1460 1465 1470 ctg atc atc aat ctg ttt gtg gct gtc atc atg gat aat ttc gac 4582 Leu Ile Ile Asn Leu Phe Val Ala Val Ile Met Asp Asn Phe Asp 1475 1480 1485 tat ctg acc cgg gac tgg tct att ttg ggg cct cac cat tta gat 4627 Tyr Leu Thr Arg Asp Trp Ser Ile Leu Gly Pro His His Leu Asp 1490 1495 1500 gaa ttc aaa aga ata tgg tca gaa tat gac cct gag gca aag gga 4672 Glu Phe Lys Arg Ile Trp Ser Glu Tyr Asp Pro Glu Ala Lys Gly 1505 1510 1515 agg ata aaa cac ctt gat gtg gtc act ctg ctt cga cgc atc cag 4717 Arg Ile Lys His Leu Asp Val Val Thr Leu Leu Arg Arg Ile Gln 1520 1525 1530 cct ccc ctg ggg ttt ggg aag tta tgt cca cac agg gta gcg tgc 4762 Pro Pro Leu Gly Phe Gly Lys Leu Cys Pro His Arg Val Ala Cys 1535 1540 1545 aag aga tta gtt gcc atg aac atg cct ctc aac agt gac ggg aca 4807 Lys Arg Leu Val Ala Met Asn Met Pro Leu Asn Ser Asp Gly Thr 1550 1555 1560 gtc atg ttt aat gca acc ctg ttt gct ttg gtt cga acg gct ctt 4852 Val Met Phe Asn Ala Thr Leu Phe Ala Leu Val Arg Thr Ala Leu 1565 1570 1575 aag atc aag acc gaa ggg aac ctg gag caa gct aat gaa gaa ctt 4897 Lys Ile Lys Thr Glu Gly Asn Leu Glu Gln Ala Asn Glu Glu Leu 1580 1585 1590 cgg gct gtg ata aag aaa att tgg aag aaa acc agc atg aaa tta 4942 Arg Ala Val Ile Lys Lys Ile Trp Lys Lys Thr Ser Met Lys Leu 1595 1600 1605 ctt gac caa gtt gtc cct cca gct ggt gat gat gag gta acc gtg 4987 Leu Asp Gln Val Val Pro Pro Ala Gly Asp Asp Glu Val Thr Val 1610 1615 1620 ggg aag ttc tat gcc act ttc ctg ata cag gac tac ttt agg aaa 5032 Gly Lys Phe Tyr Ala Thr Phe Leu Ile Gln Asp Tyr Phe Arg Lys 1625 1630 1635 ttc aag aaa cgg aaa gaa caa gga ctg gtg gga aag tac cct gcg 5077 Phe Lys Lys Arg Lys Glu Gln Gly Leu Val Gly Lys Tyr Pro Ala 1640 1645 1650 aag aac acc aca att gcc cta cag gcg gga tta agg aca ctg cat 5122 Lys Asn Thr Thr Ile Ala Leu Gln Ala Gly Leu Arg Thr Leu His 1655 1660 1665 gac att ggg cca gaa atc cgg cgt gct ata tcg tgt gat ttg caa 5167 Asp Ile Gly Pro Glu Ile Arg Arg Ala Ile Ser Cys Asp Leu Gln 1670 1675 1680 gat gac gag cct gag gaa aca aaa cga gaa gaa gaa gat gat gtg 5212 Asp Asp Glu Pro Glu Glu Thr Lys Arg Glu Glu Glu Asp Asp Val 1685 1690 1695 ttc aaa aga aat ggt gcc ctg ctt gga aac cat gtc aat cat gtt 5257 Phe Lys Arg Asn Gly Ala Leu Leu Gly Asn His Val Asn His Val 1700 1705 1710 aat agt gat agg aga gat tcc ctt cag cag acc aat acc acc cac 5302 Asn Ser Asp Arg Arg Asp Ser Leu Gln Gln Thr Asn Thr Thr His 1715 1720 1725 cgt ccc ctg cat gtc caa agg cct tca att cca cct gca agt gat 5347 Arg Pro Leu His Val Gln Arg Pro Ser Ile Pro Pro Ala Ser Asp 1730 1735 1740 act gag aaa ccg ctg ttt cct cca gca gga aat tcg gtg tgt cat 5392 Thr Glu Lys Pro Leu Phe Pro Pro Ala Gly Asn Ser Val Cys His 1745 1750 1755 aac cat cat aac cat aat tcc ata gga aag caa gtt ccc acc tca 5437 Asn His His Asn His Asn Ser Ile Gly Lys Gln Val Pro Thr Ser 1760 1765 1770 aca aat gcc aat ctc aat aat gcc aat atg tcc aaa gct gcc cat 5482 Thr Asn Ala Asn Leu Asn Asn Ala Asn Met Ser Lys Ala Ala His 1775 1780 1785 gga aag cgg ccc agc att ggg aac ctt gag cat gtg tct gaa aat 5527 Gly Lys Arg Pro Ser Ile Gly Asn Leu Glu His Val Ser Glu Asn 1790 1795 1800 ggg cat cat tct tcc cac aag cat gac cgg gag cct cag aga agg 5572 Gly His His Ser Ser His Lys His Asp Arg Glu Pro Gln Arg Arg 1805 1810 1815 tcc agt gtg aaa aga acc cgc tat tat gaa act tac att agg tcc 5617 Ser Ser Val Lys Arg Thr Arg Tyr Tyr Glu Thr Tyr Ile Arg Ser 1820 1825 1830 gac tca gga gat gaa cag ctc cca act att tgc cgg gaa gac cca 5662 Asp Ser Gly Asp Glu Gln Leu Pro Thr Ile Cys Arg Glu Asp Pro 1835 1840 1845 gag ata cat ggc tat ttc agg gac ccc cac tgc ttg ggg gag cag 5707 Glu Ile His Gly Tyr Phe Arg Asp Pro His Cys Leu Gly Glu Gln 1850 1855 1860 gag tat ttc agt agt gag gaa tgc tac gag gat gac agc tcg ccc 5752 Glu Tyr Phe Ser Ser Glu Glu Cys Tyr Glu Asp Asp Ser Ser Pro 1865 1870 1875 acc tgg agc agg caa aac tat ggc tac tac agc aga tac cca ggc 5797 Thr Trp Ser Arg Gln Asn Tyr Gly Tyr Tyr Ser Arg Tyr Pro Gly 1880 1885 1890 aga aac atc gac tct gag agg ccc cga ggc tac cat cat ccc caa 5842 Arg Asn Ile Asp Ser Glu Arg Pro Arg Gly Tyr His His Pro Gln 1895 1900 1905 gga ttc ttg gag gac gat gac tcg ccc gtt tgc tat gat tca cgg 5887 Gly Phe Leu Glu Asp Asp Asp Ser Pro Val Cys Tyr Asp Ser Arg 1910 1915 1920 aga tct cca agg aga cgc cta cta cct ccc acc cca gca tcc cac 5932 Arg Ser Pro Arg Arg Arg Leu Leu Pro Pro Thr Pro Ala Ser His 1925 1930 1935 cgg aga tcc tcc ttc aac ttt gag tgc ctg cgc cgg cag agc agc 5977 Arg Arg Ser Ser Phe Asn Phe Glu Cys Leu Arg Arg Gln Ser Ser 1940 1945 1950 cag gaa gag gtc ccg tcg tct ccc atc ttc ccc cat cgc acg gcc 6022 Gln Glu Glu Val Pro Ser Ser Pro Ile Phe Pro His Arg Thr Ala 1955 1960 1965 ctg cct ctg cat cta atg cag caa cag atc atg gca gtt gcc ggc 6067 Leu Pro Leu His Leu Met Gln Gln Gln Ile Met Ala Val Ala Gly 1970 1975 1980 cta gat tca agt aaa gcc cag aag tac tca ccg agt cac tcg acc 6112 Leu Asp Ser Ser Lys Ala Gln Lys Tyr Ser Pro Ser His Ser Thr 1985 1990 1995 cgg tcg tgg gcc acc cct cca gca acc cct ccc tac cgg gac tgg 6157 Arg Ser Trp Ala Thr Pro Pro Ala Thr Pro Pro Tyr Arg Asp Trp 2000 2005 2010 aca ccg tgc tac acc ccc ctg atc caa gtg gag cag tca gag gcc 6202 Thr Pro Cys Tyr Thr Pro Leu Ile Gln Val Glu Gln Ser Glu Ala 2015 2020 2025 ctg gac cag gtg aac ggc agc ctg ccg tcc ctg cac cgc agc tcc 6247 Leu Asp Gln Val Asn Gly Ser Leu Pro Ser Leu His Arg Ser Ser 2030 2035 2040 tgg tac aca gac gag ccc gac atc tcc tac cgg act ttc aca cca 6292 Trp Tyr Thr Asp Glu Pro Asp Ile Ser Tyr Arg Thr Phe Thr Pro 2045 2050 2055 gcc agc ctg act gtc ccc agc agc ttc cgg aac aaa aac agc gac 6337 Ala Ser Leu Thr Val Pro Ser Ser Phe Arg Asn Lys Asn Ser Asp 2060 2065 2070 aag cag agg agt gcg gac agc ttg gtg gag gca gtc ctg ata tcc 6382 Lys Gln Arg Ser Ala Asp Ser Leu Val Glu Ala Val Leu Ile Ser 2075 2080 2085 gaa ggc ttg gga cgc tat gca agg gac cca aaa ttt gtg tca gca 6427 Glu Gly Leu Gly Arg Tyr Ala Arg Asp Pro Lys Phe Val Ser Ala 2090 2095 2100 aca aaa cac gaa atc gct gat gcc tgt gac ctc acc atc gac gag 6472 Thr Lys His Glu Ile Ala Asp Ala Cys Asp Leu Thr Ile Asp Glu 2105 2110 2115 atg gag agt gca gcc agc acc ctg ctt aat ggg aac gtg cgt ccc 6517 Met Glu Ser Ala Ala Ser Thr Leu Leu Asn Gly Asn Val Arg Pro 2120 2125 2130 cga gcc aac ggg gat gtg ggc ccc ctc tca cac cgg cag gac tat 6562 Arg Ala Asn Gly Asp Val Gly Pro Leu Ser His Arg Gln Asp Tyr 2135 2140 2145 gag cta cag gac ttt ggt cct ggc tac agc gac gaa gag cca gac 6607 Glu Leu Gln Asp Phe Gly Pro Gly Tyr Ser Asp Glu Glu Pro Asp 2150 2155 2160 cct ggg agg gat gag gag gac ctg gcg gat gaa atg ata tgc atc 6652 Pro Gly Arg Asp Glu Glu Asp Leu Ala Asp Glu Met Ile Cys Ile 2165 2170 2175 acc acc ttg tag cccccagcga ggggcagact ggctctggcc tcaggtgggg 6704 Thr Thr Leu 2180 cgcaggagag ccaggggaaa agtgcctcat agttaggaaa gtttaggcac tagttgggag 6764 taatattcaa ttaattagac ttttgtataa gagatgtcat gcctcaagaa agccataaac 6824 ctggtaggaa caggtcccaa gcggttgagc ctggcagagt accatgcgct cggccccagc 6884 tgcaggaaac agcaggcccc gccctctcac agaggatggg tgaggaggcc agacctgccc 6944 tgccccattg tccagatggg cactgctgtg gagtctgctt ctcccatgta ccagggcacc 7004 aggcccaccc aactgaaggc atggcggcgg ggtgcagggg aaagttaaag gtgatgacga 7064 tcatcacacc tcgtgtcgtt acctcagcca tcggtctagc atatcagtca ctgggcccaa 7124 catatccatt tttaaaccct ttcccccaaa tacactgcgt cctggttcct gtttagctgt 7184 tctgaaata 7193 18 2181 PRT Homo sapiens 18 Met Met Met Met Met Met Met Lys Lys Met Gln His Gln Arg Gln Gln 1 5 10 15 Gln Ala Asp His Ala Asn Glu Ala Asn Tyr Ala Arg Gly Thr Arg Leu 20 25 30 Pro Leu Ser Gly Glu Gly Pro Thr Ser Gln Pro Asn Ser Ser Lys Gln 35 40 45 Thr Val Leu Ser Trp Gln Ala Ala Ile Asp Ala Ala Arg Gln Ala Lys 50 55 60 Ala Ala Gln Thr Met Ser Thr Ser Ala Pro Pro Pro Val Gly Ser Leu 65 70 75 80 Ser Gln Arg Lys Arg Gln Gln Tyr Ala Lys Ser Lys Lys Gln Gly Asn 85 90 95 Ser Ser Asn Ser Arg Pro Ala Arg Ala Leu Phe Cys Leu Ser Leu Asn 100 105 110 Asn Pro Ile Arg Arg Ala Cys Ile Ser Ile Val Glu Trp Lys Pro Phe 115 120 125 Asp Ile Phe Ile Leu Leu Ala Ile Phe Ala Asn Cys Val Ala Leu Ala 130 135 140 Ile Tyr Ile Pro Phe Pro Glu Asp Asp Ser Asn Ser Thr Asn His Asn 145 150 155 160 Leu Glu Lys Val Glu Tyr Ala Phe Leu Ile Ile Phe Thr Val Glu Thr 165 170 175 Phe Leu Lys Ile Ile Ala Tyr Gly Leu Leu Leu His Pro Asn Ala Tyr 180 185 190 Val Arg Asn Gly Trp Asn Leu Leu Asp Phe Val Ile Val Ile Val Gly 195 200 205 Leu Phe Ser Val Ile Leu Glu Gln Leu Thr Lys Glu Thr Glu Gly Gly 210 215 220 Asn His Ser Ser Gly Lys Ser Gly Gly Phe Asp Val Lys Ala Leu Arg 225 230 235 240 Ala Phe Arg Val Leu Arg Pro Leu Arg Leu Val Ser Gly Val Pro Ser 245 250 255 Leu Gln Val Val Leu Asn Ser Ile Ile Lys Ala Met Val Pro Leu Leu 260 265 270 His Ile Ala Leu Leu Val Leu Phe Val Ile Ile Ile Tyr Ala Ile Ile 275 280 285 Gly Leu Glu Leu Phe Ile Gly Lys Met His Lys Thr Cys Phe Phe Ala 290 295 300 Asp Ser Asp Ile Val Ala Glu Glu Asp Pro Ala Pro Cys Ala Phe Ser 305 310 315 320 Gly Asn Gly Arg Gln Cys Thr Ala Asn Gly Thr Glu Cys Arg Ser Gly 325 330 335 Trp Val Gly Pro Asn Gly Gly Ile Thr Asn Phe Asp Asn Phe Ala Phe 340 345 350 Ala Met Leu Thr Val Phe Gln Cys Ile Thr Met Glu Gly Trp Thr Asp 355 360 365 Val Leu Tyr Trp Val Asn Asp Ala Ile Gly Trp Glu Trp Pro Trp Val 370 375 380 Tyr Phe Val Ser Leu Ile Ile Leu Gly Ser Phe Phe Val Leu Asn Leu 385 390 395 400 Val Leu Gly Val Leu Ser Gly Glu Phe Ser Lys Glu Arg Glu Lys Ala 405 410 415 Lys Ala Arg Gly Asp Phe Gln Lys Leu Arg Glu Lys Gln Gln Leu Glu 420 425 430 Glu Asp Leu Lys Gly Tyr Leu Asp Trp Ile Thr Gln Ala Glu Asp Ile 435 440 445 Asp Pro Glu Asn Glu Glu Glu Gly Gly Glu Glu Gly Lys Arg Asn Thr 450 455 460 Ser Met Pro Thr Ser Glu Thr Glu Ser Val Asn Thr Glu Asn Val Ser 465 470 475 480 Gly Glu Gly Glu Asn Arg Gly Cys Cys Gly Ser Leu Trp Cys Trp Trp 485 490 495 Arg Arg Arg Gly Ala Ala Lys Ala Gly Pro Ser Gly Cys Arg Arg Trp 500 505 510 Gly Gln Ala Ile Ser Lys Ser Lys Leu Ser Arg Arg Trp Arg Arg Trp 515 520 525 Asn Arg Phe Asn Arg Arg Arg Cys Arg Ala Ala Val Lys Ser Val Thr 530 535 540 Phe Tyr Trp Leu Val Ile Val Leu Val Phe Leu Asn Thr Leu Thr Ile 545 550 555 560 Ser Ser Glu His Tyr Asn Gln Pro Asp Trp Leu Thr Gln Ile Gln Asp 565 570 575 Ile Ala Asn Lys Val Leu Leu Ala Leu Phe Thr Cys Glu Met Leu Val 580 585 590 Lys Met Tyr Ser Leu Gly Leu Gln Ala Tyr Phe Val Ser Leu Phe Asn 595 600 605 Arg Phe Asp Cys Phe Val Val Cys Gly Gly Ile Thr Glu Thr Ile Leu 610 615 620 Val Glu Leu Glu Ile Met Ser Pro Leu Gly Ile Ser Val Phe Arg Cys 625 630 635 640 Val Arg Leu Leu Arg Ile Phe Lys Val Thr Arg His Trp Thr Ser Leu 645 650 655 Ser Asn Leu Val Ala Ser Leu Leu Asn Ser Met Lys Ser Ile Ala Ser 660 665 670 Leu Leu Leu Leu Leu Phe Leu Phe Ile Ile Ile Phe Ser Leu Leu Gly 675 680 685 Met Gln Leu Phe Gly Gly Lys Phe Asn Phe Asp Glu Thr Gln Thr Lys 690 695 700 Arg Ser Thr Phe Asp Asn Phe Pro Gln Ala Leu Leu Thr Val Phe Gln 705 710 715 720 Ile Leu Thr Gly Glu Asp Trp Asn Ala Val Met Tyr Asp Gly Ile Met 725 730 735 Ala Tyr Gly Gly Pro Ser Ser Ser Gly Met Ile Val Cys Ile Tyr Phe 740 745 750 Ile Ile Leu Phe Ile Cys Gly Asn Tyr Ile Leu Leu Asn Val Phe Leu 755 760 765 Ala Ile Ala Val Asp Asn Leu Ala Asp Ala Glu Ser Leu Asn Thr Ala 770 775 780 Gln Lys Glu Glu Ala Glu Glu Lys Glu Arg Lys Lys Ile Ala Arg Lys 785 790 795 800 Glu Ser Leu Glu Asn Lys Lys Asn Asn Lys Pro Glu Val Asn Gln Ile 805 810 815 Ala Asn Ser Asp Asn Lys Val Thr Ile Asp Asp Tyr Arg Glu Glu Asp 820 825 830 Glu Asp Lys Asp Pro Tyr Pro Pro Cys Asp Val Pro Val Gly Glu Glu 835 840 845 Glu Glu Glu Glu Glu Glu Asp Glu Pro Glu Val Pro Ala Gly Pro Arg 850 855 860 Pro Arg Arg Ile Ser Glu Leu Asn Met Lys Glu Lys Ile Ala Pro Ile 865 870 875 880 Pro Glu Gly Ser Ala Phe Phe Ile Leu Ser Lys Thr Asn Pro Ile Arg 885 890 895 Val Gly Cys His Lys Leu Ile Asn His His Ile Phe Thr Asn Leu Ile 900 905 910 Leu Val Phe Ile Met Leu Ser Ser Ala Ala Leu Ala Ala Glu Asp Pro 915 920 925 Ile Arg Ser His Ser Phe Arg Asn Thr Ile Leu Gly Tyr Phe Asp Tyr 930 935 940 Ala Phe Thr Ala Ile Phe Thr Val Glu Ile Leu Leu Lys Met Thr Thr 945 950 955 960 Phe Gly Ala Phe Leu His Lys Gly Ala Phe Cys Arg Asn Tyr Phe Asn 965 970 975 Leu Leu Asp Met Leu Val Val Gly Val Ser Leu Val Ser Phe Gly Ile 980 985 990 Gln Ser Ser Ala Ile Ser Val Val Lys Ile Leu Arg Val Leu Arg Val 995 1000 1005 Leu Arg Pro Leu Arg Ala Ile Asn Arg Ala Lys Gly Leu Lys His 1010 1015 1020 Val Val Gln Cys Val Phe Val Ala Ile Arg Thr Ile Gly Asn Ile 1025 1030 1035 Met Ile Val Thr Thr Leu Leu Gln Phe Met Phe Ala Cys Ile Gly 1040 1045 1050 Val Gln Leu Phe Lys Gly Lys Phe Tyr Arg Cys Thr Asp Glu Ala 1055 1060 1065 Lys Ser Asn Pro Glu Glu Cys Arg Gly Leu Phe Ile Leu Tyr Lys 1070 1075 1080 Asp Gly Asp Val Asp Ser Pro Val Val Arg Glu Arg Ile Trp Gln 1085 1090 1095 Asn Ser Asp Phe Asn Phe Asp Asn Val Leu Ser Ala Met Met Ala 1100 1105 1110 Leu Phe Thr Val Ser Thr Phe Glu Gly Trp Pro Ala Leu Leu Tyr 1115 1120 1125 Lys Ala Ile Asp Ser Asn Gly Glu Asn Ile Gly Pro Ile Tyr Asn 1130 1135 1140 His Arg Val Glu Ile Ser Ile Phe Phe Ile Ile Tyr Ile Ile Ile 1145 1150 1155 Val Ala Phe Phe Met Met Asn Ile Phe Val Gly Phe Val Ile Val 1160 1165 1170 Thr Phe Gln Glu Gln Gly Glu Lys Glu Tyr Lys Asn Cys Glu Leu 1175 1180 1185 Asp Lys Asn Gln Arg Gln Cys Val Glu Tyr Ala Leu Lys Ala Arg 1190 1195 1200 Pro Leu Arg Arg Tyr Ile Pro Lys Asn Pro Tyr Gln Tyr Lys Phe 1205 1210 1215 Trp Tyr Val Val Asn Ser Ser Pro Phe Glu Tyr Met Met Phe Val 1220 1225 1230 Leu Ile Met Leu Asn Thr Leu Cys Leu Ala Met Gln His Tyr Glu 1235 1240 1245 Gln Ser Lys Met Phe Asn Asp Ala Met Asp Ile Leu Asn Met Val 1250 1255 1260 Phe Thr Gly Val Phe Thr Val Glu Met Val Leu Lys Val Ile Ala 1265 1270 1275 Phe Lys Pro Lys Gly Tyr Phe Ser Asp Ala Trp Asn Thr Phe Asp 1280 1285 1290 Ser Leu Ile Val Ile Gly Ser Ile Ile Asp Val Ala Leu Ser Glu 1295 1300 1305 Ala Asp Pro Thr Glu Ser Glu Asn Val Pro Val Pro Thr Ala Thr 1310 1315 1320 Pro Gly Asn Ser Glu Glu Ser Asn Arg Ile Ser Ile Thr Phe Phe 1325 1330 1335 Arg Leu Phe Arg Val Met Arg Leu Val Lys Leu Leu Ser Arg Gly 1340 1345 1350 Glu Gly Ile Arg Thr Leu Leu Trp Thr Phe Ile Lys Ser Phe Gln 1355 1360 1365 Ala Leu Pro Tyr Val Ala Leu Leu Ile Ala Met Leu Phe Phe Ile 1370 1375 1380 Tyr Ala Val Ile Gly Met Gln Met Phe Gly Lys Val Ala Met Arg 1385 1390 1395 Asp Asn Asn Gln Ile Asn Arg Asn Asn Asn Phe Gln Thr Phe Pro 1400 1405 1410 Gln Ala Val Leu Leu Leu Phe Arg Cys Ala Thr Gly Glu Ala Trp 1415 1420 1425 Gln Glu Ile Met Leu Ala Cys Leu Pro Gly Lys Leu Cys Asp Pro 1430 1435 1440 Glu Ser Asp Tyr Asn Pro Gly Glu Glu Tyr Thr Cys Gly Ser Asn 1445 1450 1455 Phe Ala Ile Val Tyr Phe Ile Ser Phe Tyr Met Leu Cys Ala Phe 1460 1465 1470 Leu Ile Ile Asn Leu Phe Val Ala Val Ile Met Asp Asn Phe Asp 1475 1480 1485 Tyr Leu Thr Arg Asp Trp Ser Ile Leu Gly Pro His His Leu Asp 1490 1495 1500 Glu Phe Lys Arg Ile Trp Ser Glu Tyr Asp Pro Glu Ala Lys Gly 1505 1510 1515 Arg Ile Lys His Leu Asp Val Val Thr Leu Leu Arg Arg Ile Gln 1520 1525 1530 Pro Pro Leu Gly Phe Gly Lys Leu Cys Pro His Arg Val Ala Cys 1535 1540 1545 Lys Arg Leu Val Ala Met Asn Met Pro Leu Asn Ser Asp Gly Thr 1550 1555 1560 Val Met Phe Asn Ala Thr Leu Phe Ala Leu Val Arg Thr Ala Leu 1565 1570 1575 Lys Ile Lys Thr Glu Gly Asn Leu Glu Gln Ala Asn Glu Glu Leu 1580 1585 1590 Arg Ala Val Ile Lys Lys Ile Trp Lys Lys Thr Ser Met Lys Leu 1595 1600 1605 Leu Asp Gln Val Val Pro Pro Ala Gly Asp Asp Glu Val Thr Val 1610 1615 1620 Gly Lys Phe Tyr Ala Thr Phe Leu Ile Gln Asp Tyr Phe Arg Lys 1625 1630 1635 Phe Lys Lys Arg Lys Glu Gln Gly Leu Val Gly Lys Tyr Pro Ala 1640 1645 1650 Lys Asn Thr Thr Ile Ala Leu Gln Ala Gly Leu Arg Thr Leu His 1655 1660 1665 Asp Ile Gly Pro Glu Ile Arg Arg Ala Ile Ser Cys Asp Leu Gln 1670 1675 1680 Asp Asp Glu Pro Glu Glu Thr Lys Arg Glu Glu Glu Asp Asp Val 1685 1690 1695 Phe Lys Arg Asn Gly Ala Leu Leu Gly Asn His Val Asn His Val 1700 1705 1710 Asn Ser Asp Arg Arg Asp Ser Leu Gln Gln Thr Asn Thr Thr His 1715 1720 1725 Arg Pro Leu His Val Gln Arg Pro Ser Ile Pro Pro Ala Ser Asp 1730 1735 1740 Thr Glu Lys Pro Leu Phe Pro Pro Ala Gly Asn Ser Val Cys His 1745 1750 1755 Asn His His Asn His Asn Ser Ile Gly Lys Gln Val Pro Thr Ser 1760 1765 1770 Thr Asn Ala Asn Leu Asn Asn Ala Asn Met Ser Lys Ala Ala His 1775 1780 1785 Gly Lys Arg Pro Ser Ile Gly Asn Leu Glu His Val Ser Glu Asn 1790 1795 1800 Gly His His Ser Ser His Lys His Asp Arg Glu Pro Gln Arg Arg 1805 1810 1815 Ser Ser Val Lys Arg Thr Arg Tyr Tyr Glu Thr Tyr Ile Arg Ser 1820 1825 1830 Asp Ser Gly Asp Glu Gln Leu Pro Thr Ile Cys Arg Glu Asp Pro 1835 1840 1845 Glu Ile His Gly Tyr Phe Arg Asp Pro His Cys Leu Gly Glu Gln 1850 1855 1860 Glu Tyr Phe Ser Ser Glu Glu Cys Tyr Glu Asp Asp Ser Ser Pro 1865 1870 1875 Thr Trp Ser Arg Gln Asn Tyr Gly Tyr Tyr Ser Arg Tyr Pro Gly 1880 1885 1890 Arg Asn Ile Asp Ser Glu Arg Pro Arg Gly Tyr His His Pro Gln 1895 1900 1905 Gly Phe Leu Glu Asp Asp Asp Ser Pro Val Cys Tyr Asp Ser Arg 1910 1915 1920 Arg Ser Pro Arg Arg Arg Leu Leu Pro Pro Thr Pro Ala Ser His 1925 1930 1935 Arg Arg Ser Ser Phe Asn Phe Glu Cys Leu Arg Arg Gln Ser Ser 1940 1945 1950 Gln Glu Glu Val Pro Ser Ser Pro Ile Phe Pro His Arg Thr Ala 1955 1960 1965 Leu Pro Leu His Leu Met Gln Gln Gln Ile Met Ala Val Ala Gly 1970 1975 1980 Leu Asp Ser Ser Lys Ala Gln Lys Tyr Ser Pro Ser His Ser Thr 1985 1990 1995 Arg Ser Trp Ala Thr Pro Pro Ala Thr Pro Pro Tyr Arg Asp Trp 2000 2005 2010 Thr Pro Cys Tyr Thr Pro Leu Ile Gln Val Glu Gln Ser Glu Ala 2015 2020 2025 Leu Asp Gln Val Asn Gly Ser Leu Pro Ser Leu His Arg Ser Ser 2030 2035 2040 Trp Tyr Thr Asp Glu Pro Asp Ile Ser Tyr Arg Thr Phe Thr Pro 2045 2050 2055 Ala Ser Leu Thr Val Pro Ser Ser Phe Arg Asn Lys Asn Ser Asp 2060 2065 2070 Lys Gln Arg Ser Ala Asp Ser Leu Val Glu Ala Val Leu Ile Ser 2075 2080 2085 Glu Gly Leu Gly Arg Tyr Ala Arg Asp Pro Lys Phe Val Ser Ala 2090 2095 2100 Thr Lys His Glu Ile Ala Asp Ala Cys Asp Leu Thr Ile Asp Glu 2105 2110 2115 Met Glu Ser Ala Ala Ser Thr Leu Leu Asn Gly Asn Val Arg Pro 2120 2125 2130 Arg Ala Asn Gly Asp Val Gly Pro Leu Ser His Arg Gln Asp Tyr 2135 2140 2145 Glu Leu Gln Asp Phe Gly Pro Gly Tyr Ser Asp Glu Glu Pro Asp 2150 2155 2160 Pro Gly Arg Asp Glu Glu Asp Leu Ala Asp Glu Met Ile Cys Ile 2165 2170 2175 Thr Thr Leu 2180 19 7193 DNA Homo sapiens gene (1)..(7193) 19 agaataaggg cagggaccgc ggctcctatc tcttggtgat ccccttcccc attccgcccc 60 cgcctcaacg cccagcacag tgccctgcac acagtagtcg ctcaataaat gttcgtgg 118 atg atg atg atg atg atg atg aaa aaa atg cag cat caa cgg cag cag 166 Met Met Met Met Met Met Met Lys Lys Met Gln His Gln Arg Gln Gln 1 5 10 15 caa gcg gac cac gcg aac gag gca aac tat gca aga ggc acc aga ctt 214 Gln Ala Asp His Ala Asn Glu Ala Asn Tyr Ala Arg Gly Thr Arg Leu 20 25 30 cct ctt tct ggt gaa gga cca act tct cag ccg aat agc tcc aag caa 262 Pro Leu Ser Gly Glu Gly Pro Thr Ser Gln Pro Asn Ser Ser Lys Gln 35 40 45 act gtc ctg tct tgg caa gct gca atc gat gct gct aga cag gcc aag 310 Thr Val Leu Ser Trp Gln Ala Ala Ile Asp Ala Ala Arg Gln Ala Lys 50 55 60 gct gcc caa act atg agc acc tct gca ccc cca cct gta gga tct ctc 358 Ala Ala Gln Thr Met Ser Thr Ser Ala Pro Pro Pro Val Gly Ser Leu 65 70 75 80 tcc caa aga aaa cgt cag caa tac gcc aag agc aaa aaa cag ggt aac 406 Ser Gln Arg Lys Arg Gln Gln Tyr Ala Lys Ser Lys Lys Gln Gly Asn 85 90 95 tcg tcc aac agc cga cct gcc cgc gcc ctt ttc tgt tta tca ctc aat 454 Ser Ser Asn Ser Arg Pro Ala Arg Ala Leu Phe Cys Leu Ser Leu Asn 100 105 110 aac ccc atc cga aga gcc tgc att agt ata gtg gaa tgg aaa cca ttt 502 Asn Pro Ile Arg Arg Ala Cys Ile Ser Ile Val Glu Trp Lys Pro Phe 115 120 125 gac ata ttt ata tta ttg gct att ttt gcc aat tgt gtg gcc tta gct 550 Asp Ile Phe Ile Leu Leu Ala Ile Phe Ala Asn Cys Val Ala Leu Ala 130 135 140 att tac atc cca ttc cct gaa gat gat tct aat tca aca aat cat aac 598 Ile Tyr Ile Pro Phe Pro Glu Asp Asp Ser Asn Ser Thr Asn His Asn 145 150 155 160 ttg gaa aaa gta gaa tat gcc ttc ctg att att ttt aca gtc gag aca 646 Leu Glu Lys Val Glu Tyr Ala Phe Leu Ile Ile Phe Thr Val Glu Thr 165 170 175 ttt ttg aag att ata gcg tat gga tta ttg cta cat cct aat gct tat 694 Phe Leu Lys Ile Ile Ala Tyr Gly Leu Leu Leu His Pro Asn Ala Tyr 180 185 190 gtt agg aat gga tgg aat tta ctg gat ttt gtt ata gta ata gta gga 742 Val Arg Asn Gly Trp Asn Leu Leu Asp Phe Val Ile Val Ile Val Gly 195 200 205 ttg ttt agt gta att ttg gaa caa tta acc aaa gaa aca gaa ggc ggg 790 Leu Phe Ser Val Ile Leu Glu Gln Leu Thr Lys Glu Thr Glu Gly Gly 210 215 220 aac cac tca agc ggc aaa tct gga ggc ttt gat gtc aaa gcc ctc cgt 838 Asn His Ser Ser Gly Lys Ser Gly Gly Phe Asp Val Lys Ala Leu Arg 225 230 235 240 gcc ttt cga gtg ttg cga cca ctt cga cta gtg tca ggg gtg ccc agt 886 Ala Phe Arg Val Leu Arg Pro Leu Arg Leu Val Ser Gly Val Pro Ser 245 250 255 tta caa gtt gtc ctg aac tcc att ata aaa gcc atg gtt ccc ctc ctt 934 Leu Gln Val Val Leu Asn Ser Ile Ile Lys Ala Met Val Pro Leu Leu 260 265 270 cac ata gcc ctt ttg gta tta ttt gta atc ata atc tat gct att ata 982 His Ile Ala Leu Leu Val Leu Phe Val Ile Ile Ile Tyr Ala Ile Ile 275 280 285 gga ttg gaa ctt ttt att gga aaa atg cac aaa aca tgt ttt ttt gct 1030 Gly Leu Glu Leu Phe Ile Gly Lys Met His Lys Thr Cys Phe Phe Ala 290 295 300 gac tca gat atc gta gct gaa gag gac cca gct cca tgt gcg ttc tca 1078 Asp Ser Asp Ile Val Ala Glu Glu Asp Pro Ala Pro Cys Ala Phe Ser 305 310 315 320 ggg aat gga cgc cag tgt act gcc aat ggc acg gaa tgt agg agt ggc 1126 Gly Asn Gly Arg Gln Cys Thr Ala Asn Gly Thr Glu Cys Arg Ser Gly 325 330 335 tgg gtt ggc ccg aac gga ggc atc acc aac ttt gat aac ttt gcc ttt 1174 Trp Val Gly Pro Asn Gly Gly Ile Thr Asn Phe Asp Asn Phe Ala Phe 340 345 350 gcc atg ctt act gtg ttt cag tgc atc acc atg gag ggc tgg aca gac 1222 Ala Met Leu Thr Val Phe Gln Cys Ile Thr Met Glu Gly Trp Thr Asp 355 360 365 gtg ctc tac tgg gta aat gat gcg ata gga tgg gaa tgg cca tgg gtg 1270 Val Leu Tyr Trp Val Asn Asp Ala Ile Gly Trp Glu Trp Pro Trp Val 370 375 380 tat ttt gtt agt ctg atc atc ctt ggc tca ttt ttc gtc ctt aac ctg 1318 Tyr Phe Val Ser Leu Ile Ile Leu Gly Ser Phe Phe Val Leu Asn Leu 385 390 395 400 gtt ctt ggt gtc ctt agt gga gaa ttc tca aag gaa aga gag aag gca 1366 Val Leu Gly Val Leu Ser Gly Glu Phe Ser Lys Glu Arg Glu Lys Ala 405 410 415 aaa gca cgg gga gat ttc cag aag ctc cgg gag aag cag cag ctg gag 1414 Lys Ala Arg Gly Asp Phe Gln Lys Leu Arg Glu Lys Gln Gln Leu Glu 420 425 430 gag gat cta aag ggc tac ttg gat tgg atc acc caa gct gag gac atc 1462 Glu Asp Leu Lys Gly Tyr Leu Asp Trp Ile Thr Gln Ala Glu Asp Ile 435 440 445 gat ccg gag aat gag gaa gaa gga gga gag gaa ggc aaa cga aat act 1510 Asp Pro Glu Asn Glu Glu Glu Gly Gly Glu Glu Gly Lys Arg Asn Thr 450 455 460 agc atg ccc acc agc gag act gag tct gtg aac aca gag aac gtc agc 1558 Ser Met Pro Thr Ser Glu Thr Glu Ser Val Asn Thr Glu Asn Val Ser 465 470 475 480 ggt gaa ggc gag aac cga ggc tgc tgt gga agt ctc tgg tgc tgg tgg 1606 Gly Glu Gly Glu Asn Arg Gly Cys Cys Gly Ser Leu Trp Cys Trp Trp 485 490 495 aga cgg aga ggc gcg gcc aag gcg ggg ccc tct ggg tgt cgg cgg tgg 1654 Arg Arg Arg Gly Ala Ala Lys Ala Gly Pro Ser Gly Cys Arg Arg Trp 500 505 510 ggt caa gcc atc tca aaa tcc aaa ctc agc cga cgc tgg cgt cgc tgg 1702 Gly Gln Ala Ile Ser Lys Ser Lys Leu Ser Arg Arg Trp Arg Arg Trp 515 520 525 aac cga ttc aat cgc aga aga tgt agg gcc gcc gtg aag tct gtc acg 1750 Asn Arg Phe Asn Arg Arg Arg Cys Arg Ala Ala Val Lys Ser Val Thr 530 535 540 ttt tac tgg ctg gtt atc gtc ctg gtg ttt ctg aac acc tta acc att 1798 Phe Tyr Trp Leu Val Ile Val Leu Val Phe Leu Asn Thr Leu Thr Ile 545 550 555 560 tcc tct gag cac tac aat cag cca gat tgg ttg aca cag att caa gat 1846 Ser Ser Glu His Tyr Asn Gln Pro Asp Trp Leu Thr Gln Ile Gln Asp 565 570 575 att gcc aac aaa gtc ctc ttg gct ctg ttc acc tgc gag atg ctg gta 1894 Ile Ala Asn Lys Val Leu Leu Ala Leu Phe Thr Cys Glu Met Leu Val 580 585 590 aaa atg tac agc ttg ggc ctc caa gca tat ttc gtc tct ctt ttc aac 1942 Lys Met Tyr Ser Leu Gly Leu Gln Ala Tyr Phe Val Ser Leu Phe Asn 595 600 605 cgg ttt gat tgc ttc gtg gtg tgt ggt gga atc act gag acg atc ctg 1990 Arg Phe Asp Cys Phe Val Val Cys Gly Gly Ile Thr Glu Thr Ile Leu 610 615 620 gtg gaa ctg gaa atc atg tct ccc ctg ggg atc tct gtg ttt cgg tgt 2038 Val Glu Leu Glu Ile Met Ser Pro Leu Gly Ile Ser Val Phe Arg Cys 625 630 635 640 gtg cgc ctc tta aga atc ttc aaa gtg acc agg cac tgg act tcc ctg 2086 Val Arg Leu Leu Arg Ile Phe Lys Val Thr Arg His Trp Thr Ser Leu 645 650 655 agc aac tta gtg gca tcc tta tta aac tcc atg aag tcc atc gct tcg 2134 Ser Asn Leu Val Ala Ser Leu Leu Asn Ser Met Lys Ser Ile Ala Ser 660 665 670 ctg ttg ctt ctg ctt ttt ctc ttc att atc atc ttt tcc ttg ctt ggg 2182 Leu Leu Leu Leu Leu Phe Leu Phe Ile Ile Ile Phe Ser Leu Leu Gly 675 680 685 atg cag ctg ttt ggc ggc aag ttt aat ttt gat gaa acg caa acc aag 2230 Met Gln Leu Phe Gly Gly Lys Phe Asn Phe Asp Glu Thr Gln Thr Lys 690 695 700 cgg agc acc ttt gac aat ttc cct caa gca ctt ctc aca gtg ttc cag 2278 Arg Ser Thr Phe Asp Asn Phe Pro Gln Ala Leu Leu Thr Val Phe Gln 705 710 715 720 atc ctg aca ggc gaa gac tgg aat gct gtg atg tac gat ggc atc atg 2326 Ile Leu Thr Gly Glu Asp Trp Asn Ala Val Met Tyr Asp Gly Ile Met 725 730 735 gct tac ggg ggc cca tcc tct tca gga atg atc gtc tgc atc tac ttc 2374 Ala Tyr Gly Gly Pro Ser Ser Ser Gly Met Ile Val Cys Ile Tyr Phe 740 745 750 atc atc ctc ttc att tgt ggt aac tat att cta ctg aat gtc ttc ttg 2422 Ile Ile Leu Phe Ile Cys Gly Asn Tyr Ile Leu Leu Asn Val Phe Leu 755 760 765 gcc atc gct gta gac aat ttg gct gat gct gaa agt ctg aac act gct 2470 Ala Ile Ala Val Asp Asn Leu Ala Asp Ala Glu Ser Leu Asn Thr Ala 770 775 780 cag aaa gaa gaa gcg gaa gaa aag gag agg aaa aag att gcc aga aaa 2518 Gln Lys Glu Glu Ala Glu Glu Lys Glu Arg Lys Lys Ile Ala Arg Lys 785 790 795 800 gag agc cta gaa aat aaa aag aac aac aaa cca gaa gtc aac cag ata 2566 Glu Ser Leu Glu Asn Lys Lys Asn Asn Lys Pro Glu Val Asn Gln Ile 805 810 815 gcc aac agt gac aac aag gtt aca att gat gac tat aga gaa gag gat 2614 Ala Asn Ser Asp Asn Lys Val Thr Ile Asp Asp Tyr Arg Glu Glu Asp 820 825 830 gaa gac aag gac ccc tat ccg cct tgc gat gtg cca gta ggg gaa gag 2662 Glu Asp Lys Asp Pro Tyr Pro Pro Cys Asp Val Pro Val Gly Glu Glu 835 840 845 gaa gag gaa gag gag gag gat gaa cct gag gtt cct gcc gga ccc cgt 2710 Glu Glu Glu Glu Glu Glu Asp Glu Pro Glu Val Pro Ala Gly Pro Arg 850 855 860 cct cga agg atc tcg gag ttg aac atg aag gaa aaa att gcc ccc atc 2758 Pro Arg Arg Ile Ser Glu Leu Asn Met Lys Glu Lys Ile Ala Pro Ile 865 870 875 880 cct gaa ggg agc gct ttc ttc att ctt agc aag acc aac ccg atc cgc 2806 Pro Glu Gly Ser Ala Phe Phe Ile Leu Ser Lys Thr Asn Pro Ile Arg 885 890 895 gta ggc tgc cac aag ctc atc aac cac cac atc ttc acc aac ctc atc 2854 Val Gly Cys His Lys Leu Ile Asn His His Ile Phe Thr Asn Leu Ile 900 905 910 ctt gtc ttc atc atg ctg agc agc gct gcc ctg gcc gca gag gac ccc 2902 Leu Val Phe Ile Met Leu Ser Ser Ala Ala Leu Ala Ala Glu Asp Pro 915 920 925 atc cgc agc cac tcc ttc cgg aac acg ata ctg ggt tac ttt gac tat 2950 Ile Arg Ser His Ser Phe Arg Asn Thr Ile Leu Gly Tyr Phe Asp Tyr 930 935 940 gcc ttc aca gcc atc ttt act gtt gag atc ctg ttg aag atg aca act 2998 Ala Phe Thr Ala Ile Phe Thr Val Glu Ile Leu Leu Lys Met Thr Thr 945 950 955 960 ttt gga gct ttc ctc cac aaa ggg gcc ttc tgc agg aac tac ttc aat 3046 Phe Gly Ala Phe Leu His Lys Gly Ala Phe Cys Arg Asn Tyr Phe Asn 965 970 975 ttg ctg gat atg ctg gtg gtt ggg gtg tct ctg gtg tca ttt ggg att 3094 Leu Leu Asp Met Leu Val Val Gly Val Ser Leu Val Ser Phe Gly Ile 980 985 990 caa tcc agt gcc atc tcc gtt gtg aag att ctg agg gtc tta agg gtc 3142 Gln Ser Ser Ala Ile Ser Val Val Lys Ile Leu Arg Val Leu Arg Val 995 1000 1005 ctg cgt ccc ctc agg gcc atc aac aga gca aaa gga ctt aag cac 3187 Leu Arg Pro Leu Arg Ala Ile Asn Arg Ala Lys Gly Leu Lys His 1010 1015 1020 gtg gtc cag tgc gtc ttc gtg gcc atc cgg acc atc ggc aac atc 3232 Val Val Gln Cys Val Phe Val Ala Ile Arg Thr Ile Gly Asn Ile 1025 1030 1035 atg atc gtc act acc ctc ctg cag ttc atg ttt gcc tgt atc ggg 3277 Met Ile Val Thr Thr Leu Leu Gln Phe Met Phe Ala Cys Ile Gly 1040 1045 1050 gtc cag ttg ttc aag ggg aag ttc tat cgc tgt acg gat gaa gcc 3322 Val Gln Leu Phe Lys Gly Lys Phe Tyr Arg Cys Thr Asp Glu Ala 1055 1060 1065 aaa agt aac cct gaa gaa tgc agg gga ctt ttc atc ctc tac aag 3367 Lys Ser Asn Pro Glu Glu Cys Arg Gly Leu Phe Ile Leu Tyr Lys 1070 1075 1080 gat ggg gat gtt gac agt cct gtg gtc cgt gaa cgg atc tgg caa 3412 Asp Gly Asp Val Asp Ser Pro Val Val Arg Glu Arg Ile Trp Gln 1085 1090 1095 aac agt gat ttc aac ttc gac aac gtc ctc tct gct atg atg gcg 3457 Asn Ser Asp Phe Asn Phe Asp Asn Val Leu Ser Ala Met Met Ala 1100 1105 1110 ctc ttc aca gtc tcc acg ttt gag ggc tgg cct gcg ttg ctg tat 3502 Leu Phe Thr Val Ser Thr Phe Glu Gly Trp Pro Ala Leu Leu Tyr 1115 1120 1125 aaa gcc atc gac tcg aat gga gag aac atc ggc cca atc tac aac 3547 Lys Ala Ile Asp Ser Asn Gly Glu Asn Ile Gly Pro Ile Tyr Asn 1130 1135 1140 cac cgc gtg gag atc tcc atc ttc ttc atc atc tac atc atc att 3592 His Arg Val Glu Ile Ser Ile Phe Phe Ile Ile Tyr Ile Ile Ile 1145 1150 1155 gta gct ttc ttc atg atg aac atc ttt gtg ggc ttt gtc atc gtt 3637 Val Ala Phe Phe Met Met Asn Ile Phe Val Gly Phe Val Ile Val 1160 1165 1170 aca ttt cag gaa caa gga gaa aaa gag tat aag aac tgt gag ctg 3682 Thr Phe Gln Glu Gln Gly Glu Lys Glu Tyr Lys Asn Cys Glu Leu 1175 1180 1185 gac aaa aat cag cgt cag tgt gtt gaa tac gcc ttg aaa gca cgt 3727 Asp Lys Asn Gln Arg Gln Cys Val Glu Tyr Ala Leu Lys Ala Arg 1190 1195 1200 ccc ttg cgg aga tac atc ccc aaa aac ccc tac cag tac aag ttc 3772 Pro Leu Arg Arg Tyr Ile Pro Lys Asn Pro Tyr Gln Tyr Lys Phe 1205 1210 1215 tgg tac gtg gtg aac tct tcg cct ttc gaa tac atg atg ttt gtc 3817 Trp Tyr Val Val Asn Ser Ser Pro Phe Glu Tyr Met Met Phe Val 1220 1225 1230 ctc atc atg ctc aac aca ctc tgc ttg gcc atg cag cac tac gag 3862 Leu Ile Met Leu Asn Thr Leu Cys Leu Ala Met Gln His Tyr Glu 1235 1240 1245 cag tcc aag atg ttc aat gat gcc atg gac att ctg aac atg gtc 3907 Gln Ser Lys Met Phe Asn Asp Ala Met Asp Ile Leu Asn Met Val 1250 1255 1260 ttc acc ggg gtg ttc acc gtc gag atg gtt ttg aaa gtc atc gca 3952 Phe Thr Gly Val Phe Thr Val Glu Met Val Leu Lys Val Ile Ala 1265 1270 1275 ttt aag cct aag ggg tat ttt agt gac gcc tgg aac acg ttt gac 3997 Phe Lys Pro Lys Gly Tyr Phe Ser Asp Ala Trp Asn Thr Phe Asp 1280 1285 1290 tcc ctc atc gta atc ggc agc att ata gac gtg gcc ctc agc gaa 4042 Ser Leu Ile Val Ile Gly Ser Ile Ile Asp Val Ala Leu Ser Glu 1295 1300 1305 gcg gac cca act gaa agt gaa aat gtc cct gtc cca act gct aca 4087 Ala Asp Pro Thr Glu Ser Glu Asn Val Pro Val Pro Thr Ala Thr 1310 1315 1320 cct ggg aac tct gaa gag agc aat aga atc tcc atc acc ttt ttc 4132 Pro Gly Asn Ser Glu Glu Ser Asn Arg Ile Ser Ile Thr Phe Phe 1325 1330 1335 cgt ctt ttc cga gtg atg cga ttg gtg aag ctt ctc agc agg ggg 4177 Arg Leu Phe Arg Val Met Arg Leu Val Lys Leu Leu Ser Arg Gly 1340 1345 1350 gaa ggc atc cgg aca ttg ctg tgg act ttt att aag tcc ttt cag 4222 Glu Gly Ile Arg Thr Leu Leu Trp Thr Phe Ile Lys Ser Phe Gln 1355 1360 1365 gcg ctc ccg tat gtg gcc ctc ctc ata gcc atg ctg ttc ttc atc 4267 Ala Leu Pro Tyr Val Ala Leu Leu Ile Ala Met Leu Phe Phe Ile 1370 1375 1380 tat gcg gtc att ggc atg cag atg ttt ggg aaa gtt gcc atg aga 4312 Tyr Ala Val Ile Gly Met Gln Met Phe Gly Lys Val Ala Met Arg 1385 1390 1395 gat aac aac cag atc aat agg aac aat aac ttc cag acg ttt ccc 4357 Asp Asn Asn Gln Ile Asn Arg Asn Asn Asn Phe Gln Thr Phe Pro 1400 1405 1410 cag gcg gtg ctg ctg ctc ttc agg tgt gca aca ggt gag gcc tgg 4402 Gln Ala Val Leu Leu Leu Phe Arg Cys Ala Thr Gly Glu Ala Trp 1415 1420 1425 cag gag atc atg ctg gcc tgt ctc cca ggg aag ctc tgt gac cct 4447 Gln Glu Ile Met Leu Ala Cys Leu Pro Gly Lys Leu Cys Asp Pro 1430 1435 1440 gag tca gat tac aac ccc ggg gag gag tat aca tgt ggg agc aac 4492 Glu Ser Asp Tyr Asn Pro Gly Glu Glu Tyr Thr Cys Gly Ser Asn 1445 1450 1455 ttt gcc att gtc tat ttc atc agt ttt tac atg ctc tgt gca ttt 4537 Phe Ala Ile Val Tyr Phe Ile Ser Phe Tyr Met Leu Cys Ala Phe 1460 1465 1470 ctg atc atc aat ctg ttt gtg gct gtc atc atg gat aat ttc gac 4582 Leu Ile Ile Asn Leu Phe Val Ala Val Ile Met Asp Asn Phe Asp 1475 1480 1485 tat ctg acc cgg gac tgg tct att ttg ggg cct cac cat tta gat 4627 Tyr Leu Thr Arg Asp Trp Ser Ile Leu Gly Pro His His Leu Asp 1490 1495 1500 gaa ttc aaa aga ata tgg tca gaa tat gac cct gag gca aag gga 4672 Glu Phe Lys Arg Ile Trp Ser Glu Tyr Asp Pro Glu Ala Lys Gly 1505 1510 1515 agg ata aaa cac ctt gat gtg gtc act ctg ctt cga cgc atc cag 4717 Arg Ile Lys His Leu Asp Val Val Thr Leu Leu Arg Arg Ile Gln 1520 1525 1530 cct ccc ctg ggg ttt ggg aag tta tgt cca cac agg gta gcg tgc 4762 Pro Pro Leu Gly Phe Gly Lys Leu Cys Pro His Arg Val Ala Cys 1535 1540 1545 aag aga tta gtt gcc atg aac atg cct ctc aac agt gac ggg aca 4807 Lys Arg Leu Val Ala Met Asn Met Pro Leu Asn Ser Asp Gly Thr 1550 1555 1560 gtc atg ttt aat gca acc ctg ttt gct ttg gtt cga acg gct ctt 4852 Val Met Phe Asn Ala Thr Leu Phe Ala Leu Val Arg Thr Ala Leu 1565 1570 1575 aag atc aag acc gaa ggg aac ctg gag caa gct aat gaa gaa ctt 4897 Lys Ile Lys Thr Glu Gly Asn Leu Glu Gln Ala Asn Glu Glu Leu 1580 1585 1590 cgg gct gtg ata aag aaa att tgg aag aaa acc agc atg aaa tta 4942 Arg Ala Val Ile Lys Lys Ile Trp Lys Lys Thr Ser Met Lys Leu 1595 1600 1605 ctt gac caa gtt gtc cct cca gct ggt gat gat gag gta acc gtg 4987 Leu Asp Gln Val Val Pro Pro Ala Gly Asp Asp Glu Val Thr Val 1610 1615 1620 ggg aag ttc tat gcc act ttc ctg ata cag gac tac ttt agg aaa 5032 Gly Lys Phe Tyr Ala Thr Phe Leu Ile Gln Asp Tyr Phe Arg Lys 1625 1630 1635 ttc aag aaa cgg aaa gaa caa gga ctg gtg gga aag tac cct gcg 5077 Phe Lys Lys Arg Lys Glu Gln Gly Leu Val Gly Lys Tyr Pro Ala 1640 1645 1650 aag aac acc aca att gcc cta cag gcg gga tta agg aca ctg cat 5122 Lys Asn Thr Thr Ile Ala Leu Gln Ala Gly Leu Arg Thr Leu His 1655 1660 1665 gac att ggg cca gaa atc cgg cgt gct ata tcg tgt gat ttg caa 5167 Asp Ile Gly Pro Glu Ile Arg Arg Ala Ile Ser Cys Asp Leu Gln 1670 1675 1680 gat gac gag cct gag gaa aca aaa cga gaa gaa gaa gat gat gtg 5212 Asp Asp Glu Pro Glu Glu Thr Lys Arg Glu Glu Glu Asp Asp Val 1685 1690 1695 ttc aaa aga aat ggt gcc ctg ctt gga aac cat gtc aat cat gtt 5257 Phe Lys Arg Asn Gly Ala Leu Leu Gly Asn His Val Asn His Val 1700 1705 1710 aat agt gat agg aga gat tcc ctt cag cag acc aat acc acc cac 5302 Asn Ser Asp Arg Arg Asp Ser Leu Gln Gln Thr Asn Thr Thr His 1715 1720 1725 cgt ccc ctg cat gtc caa agg cct tca att cca cct gca agt gat 5347 Arg Pro Leu His Val Gln Arg Pro Ser Ile Pro Pro Ala Ser Asp 1730 1735 1740 act gag aaa ccg ctg ttt cct cca gca gga aat tcg gtg tgt cat 5392 Thr Glu Lys Pro Leu Phe Pro Pro Ala Gly Asn Ser Val Cys His 1745 1750 1755 aac cat cat aac cat aat tcc ata gga aag caa gtt ccc acc tca 5437 Asn His His Asn His Asn Ser Ile Gly Lys Gln Val Pro Thr Ser 1760 1765 1770 aca aat gcc aat ctc aat aat gcc aat atg tcc aaa gct gcc cat 5482 Thr Asn Ala Asn Leu Asn Asn Ala Asn Met Ser Lys Ala Ala His 1775 1780 1785 gga aag cgg ccc agc att ggg aac ctt gag cat gtg tct gaa aat 5527 Gly Lys Arg Pro Ser Ile Gly Asn Leu Glu His Val Ser Glu Asn 1790 1795 1800 ggg cat cat tct tcc cac aag cat gac cgg gag cct cag aga agg 5572 Gly His His Ser Ser His Lys His Asp Arg Glu Pro Gln Arg Arg 1805 1810 1815 tcc agt gtg aaa aga acc cgc tat tat gaa act tac att agg tcc 5617 Ser Ser Val Lys Arg Thr Arg Tyr Tyr Glu Thr Tyr Ile Arg Ser 1820 1825 1830 gac tca gga gat gaa cag ctc cca act att tgc cgg gaa gac cca 5662 Asp Ser Gly Asp Glu Gln Leu Pro Thr Ile Cys Arg Glu Asp Pro 1835 1840 1845 gag ata cat ggc tat ttc agg gac ccc cac tgc ttg ggg gag cag 5707 Glu Ile His Gly Tyr Phe Arg Asp Pro His Cys Leu Gly Glu Gln 1850 1855 1860 gag tat ttc agt agt gag gaa tgc tac gag gat gac agc tcg ccc 5752 Glu Tyr Phe Ser Ser Glu Glu Cys Tyr Glu Asp Asp Ser Ser Pro 1865 1870 1875 acc tgg agc agg caa aac tat ggc tac tac agc aga tac cca ggc 5797 Thr Trp Ser Arg Gln Asn Tyr Gly Tyr Tyr Ser Arg Tyr Pro Gly 1880 1885 1890 aga aac atc gac tct gag agg ccc cga ggc tac cat cat ccc caa 5842 Arg Asn Ile Asp Ser Glu Arg Pro Arg Gly Tyr His His Pro Gln 1895 1900 1905 gga ttc ttg gag gac gat gac tcg ccc gtt tgc tat gat tca cgg 5887 Gly Phe Leu Glu Asp Asp Asp Ser Pro Val Cys Tyr Asp Ser Arg 1910 1915 1920 aga tct cca agg aga cgc cta cta cct ccc acc cca gca tcc cac 5932 Arg Ser Pro Arg Arg Arg Leu Leu Pro Pro Thr Pro Ala Ser His 1925 1930 1935 cgg aga tcc tcc ttc aac ttt gag tgc ctg cgc cgg cag agc agc 5977 Arg Arg Ser Ser Phe Asn Phe Glu Cys Leu Arg Arg Gln Ser Ser 1940 1945 1950 cag gaa gag gtc ccg tcg tct ccc atc ttc ccc cat cgc acg gcc 6022 Gln Glu Glu Val Pro Ser Ser Pro Ile Phe Pro His Arg Thr Ala 1955 1960 1965 ctg cct ctg cat cta atg cag caa cag atc atg gca gtt gcc ggc 6067 Leu Pro Leu His Leu Met Gln Gln Gln Ile Met Ala Val Ala Gly 1970 1975 1980 cta gat tca agt aaa gcc cag aag tac tca ccg agt cac tcg acc 6112 Leu Asp Ser Ser Lys Ala Gln Lys Tyr Ser Pro Ser His Ser Thr 1985 1990 1995 cgg tcg tgg gcc acc cct cca gca acc cct ccc tac cgg gac tgg 6157 Arg Ser Trp Ala Thr Pro Pro Ala Thr Pro Pro Tyr Arg Asp Trp 2000 2005 2010 aca ccg tgc tac acc ccc ctg atc caa gtg gag cag tca gag gcc 6202 Thr Pro Cys Tyr Thr Pro Leu Ile Gln Val Glu Gln Ser Glu Ala 2015 2020 2025 ctg gac cag gtg aac ggc agc ctg ccg tcc ctg cac cgc agc tcc 6247 Leu Asp Gln Val Asn Gly Ser Leu Pro Ser Leu His Arg Ser Ser 2030 2035 2040 tgg tac aca gac gag ccc gac atc tcc tac cgg act ttc aca cca 6292 Trp Tyr Thr Asp Glu Pro Asp Ile Ser Tyr Arg Thr Phe Thr Pro 2045 2050 2055 gcc agc ctg act gtc ccc agc agc ttc cgg aac aaa aac agc gac 6337 Ala Ser Leu Thr Val Pro Ser Ser Phe Arg Asn Lys Asn Ser Asp 2060 2065 2070 aag cag agg agt gcg gac agc ttg gtg gag gca gtc ctg ata tcc 6382 Lys Gln Arg Ser Ala Asp Ser Leu Val Glu Ala Val Leu Ile Ser 2075 2080 2085 gaa ggc ttg gga cgc tat gca agg gac cca aaa ttt gtg tca gca 6427 Glu Gly Leu Gly Arg Tyr Ala Arg Asp Pro Lys Phe Val Ser Ala 2090 2095 2100 aca aaa cac gaa atc gct gat gcc tgt gac ctc acc atc gac gag 6472 Thr Lys His Glu Ile Ala Asp Ala Cys Asp Leu Thr Ile Asp Glu 2105 2110 2115 atg gag agt gca gcc agc acc ctg ctt aat ggg aac gtg cgt ccc 6517 Met Glu Ser Ala Ala Ser Thr Leu Leu Asn Gly Asn Val Arg Pro 2120 2125 2130 cga gcc aac ggg gat gtg ggc ccc ctc tca cac cgg cag gac tat 6562 Arg Ala Asn Gly Asp Val Gly Pro Leu Ser His Arg Gln Asp Tyr 2135 2140 2145 gag cta cag gac ttt ggt cct ggc tac agc gac gaa gag cca gac 6607 Glu Leu Gln Asp Phe Gly Pro Gly Tyr Ser Asp Glu Glu Pro Asp 2150 2155 2160 cct ggg agg gat gag gag gac ctg gcg gat gaa atg ata tgc atc 6652 Pro Gly Arg Asp Glu Glu Asp Leu Ala Asp Glu Met Ile Cys Ile 2165 2170 2175 acc acc ttg tag cccccagcga ggggcagact ggctctggcc tcaggtgggg 6704 Thr Thr Leu 2180 cgcaggagag ccaggggaaa agtgcctcat agttaggaaa gtttaggcac tagttgggag 6764 taatattcaa ttaattagac ttttgtataa gagatgtcat gcctcaagaa agccataaac 6824 ctggtaggaa caggtcccaa gcggttgagc ctggcagagt accatgcgct cggccccagc 6884 tgcaggaaac agcaggcccc gccctctcac agaggatggg tgaggaggcc agacctgccc 6944 tgccccattg tccagatggg cactgctgtg gagtctgctt ctcccatgta ccagggcacc 7004 aggcccaccc aactgaaggc atggcggcgg ggtgcagggg aaagttaaag gtgatgacga 7064 tcatcacacc tcgtgtcgtt acctcagcca tcggtctagc atatcagtca ctgggcccaa 7124 catatccatt tttaaaccct ttcccccaaa tacactgcgt cctggttcct gtttagctgt 7184 tctgaaata 7193 20 2181 PRT Homo sapiens 20 Met Met Met Met Met Met Met Lys Lys Met Gln His Gln Arg Gln Gln 1 5 10 15 Gln Ala Asp His Ala Asn Glu Ala Asn Tyr Ala Arg Gly Thr Arg Leu 20 25 30 Pro Leu Ser Gly Glu Gly Pro Thr Ser Gln Pro Asn Ser Ser Lys Gln 35 40 45 Thr Val Leu Ser Trp Gln Ala Ala Ile Asp Ala Ala Arg Gln Ala Lys 50 55 60 Ala Ala Gln Thr Met Ser Thr Ser Ala Pro Pro Pro Val Gly Ser Leu 65 70 75 80 Ser Gln Arg Lys Arg Gln Gln Tyr Ala Lys Ser Lys Lys Gln Gly Asn 85 90 95 Ser Ser Asn Ser Arg Pro Ala Arg Ala Leu Phe Cys Leu Ser Leu Asn 100 105 110 Asn Pro Ile Arg Arg Ala Cys Ile Ser Ile Val Glu Trp Lys Pro Phe 115 120 125 Asp Ile Phe Ile Leu Leu Ala Ile Phe Ala Asn Cys Val Ala Leu Ala 130 135 140 Ile Tyr Ile Pro Phe Pro Glu Asp Asp Ser Asn Ser Thr Asn His Asn 145 150 155 160 Leu Glu Lys Val Glu Tyr Ala Phe Leu Ile Ile Phe Thr Val Glu Thr 165 170 175 Phe Leu Lys Ile Ile Ala Tyr Gly Leu Leu Leu His Pro Asn Ala Tyr 180 185 190 Val Arg Asn Gly Trp Asn Leu Leu Asp Phe Val Ile Val Ile Val Gly 195 200 205 Leu Phe Ser Val Ile Leu Glu Gln Leu Thr Lys Glu Thr Glu Gly Gly 210 215 220 Asn His Ser Ser Gly Lys Ser Gly Gly Phe Asp Val Lys Ala Leu Arg 225 230 235 240 Ala Phe Arg Val Leu Arg Pro Leu Arg Leu Val Ser Gly Val Pro Ser 245 250 255 Leu Gln Val Val Leu Asn Ser Ile Ile Lys Ala Met Val Pro Leu Leu 260 265 270 His Ile Ala Leu Leu Val Leu Phe Val Ile Ile Ile Tyr Ala Ile Ile 275 280 285 Gly Leu Glu Leu Phe Ile Gly Lys Met His Lys Thr Cys Phe Phe Ala 290 295 300 Asp Ser Asp Ile Val Ala Glu Glu Asp Pro Ala Pro Cys Ala Phe Ser 305 310 315 320 Gly Asn Gly Arg Gln Cys Thr Ala Asn Gly Thr Glu Cys Arg Ser Gly 325 330 335 Trp Val Gly Pro Asn Gly Gly Ile Thr Asn Phe Asp Asn Phe Ala Phe 340 345 350 Ala Met Leu Thr Val Phe Gln Cys Ile Thr Met Glu Gly Trp Thr Asp 355 360 365 Val Leu Tyr Trp Val Asn Asp Ala Ile Gly Trp Glu Trp Pro Trp Val 370 375 380 Tyr Phe Val Ser Leu Ile Ile Leu Gly Ser Phe Phe Val Leu Asn Leu 385 390 395 400 Val Leu Gly Val Leu Ser Gly Glu Phe Ser Lys Glu Arg Glu Lys Ala 405 410 415 Lys Ala Arg Gly Asp Phe Gln Lys Leu Arg Glu Lys Gln Gln Leu Glu 420 425 430 Glu Asp Leu Lys Gly Tyr Leu Asp Trp Ile Thr Gln Ala Glu Asp Ile 435 440 445 Asp Pro Glu Asn Glu Glu Glu Gly Gly Glu Glu Gly Lys Arg Asn Thr 450 455 460 Ser Met Pro Thr Ser Glu Thr Glu Ser Val Asn Thr Glu Asn Val Ser 465 470 475 480 Gly Glu Gly Glu Asn Arg Gly Cys Cys Gly Ser Leu Trp Cys Trp Trp 485 490 495 Arg Arg Arg Gly Ala Ala Lys Ala Gly Pro Ser Gly Cys Arg Arg Trp 500 505 510 Gly Gln Ala Ile Ser Lys Ser Lys Leu Ser Arg Arg Trp Arg Arg Trp 515 520 525 Asn Arg Phe Asn Arg Arg Arg Cys Arg Ala Ala Val Lys Ser Val Thr 530 535 540 Phe Tyr Trp Leu Val Ile Val Leu Val Phe Leu Asn Thr Leu Thr Ile 545 550 555 560 Ser Ser Glu His Tyr Asn Gln Pro Asp Trp Leu Thr Gln Ile Gln Asp 565 570 575 Ile Ala Asn Lys Val Leu Leu Ala Leu Phe Thr Cys Glu Met Leu Val 580 585 590 Lys Met Tyr Ser Leu Gly Leu Gln Ala Tyr Phe Val Ser Leu Phe Asn 595 600 605 Arg Phe Asp Cys Phe Val Val Cys Gly Gly Ile Thr Glu Thr Ile Leu 610 615 620 Val Glu Leu Glu Ile Met Ser Pro Leu Gly Ile Ser Val Phe Arg Cys 625 630 635 640 Val Arg Leu Leu Arg Ile Phe Lys Val Thr Arg His Trp Thr Ser Leu 645 650 655 Ser Asn Leu Val Ala Ser Leu Leu Asn Ser Met Lys Ser Ile Ala Ser 660 665 670 Leu Leu Leu Leu Leu Phe Leu Phe Ile Ile Ile Phe Ser Leu Leu Gly 675 680 685 Met Gln Leu Phe Gly Gly Lys Phe Asn Phe Asp Glu Thr Gln Thr Lys 690 695 700 Arg Ser Thr Phe Asp Asn Phe Pro Gln Ala Leu Leu Thr Val Phe Gln 705 710 715 720 Ile Leu Thr Gly Glu Asp Trp Asn Ala Val Met Tyr Asp Gly Ile Met 725 730 735 Ala Tyr Gly Gly Pro Ser Ser Ser Gly Met Ile Val Cys Ile Tyr Phe 740 745 750 Ile Ile Leu Phe Ile Cys Gly Asn Tyr Ile Leu Leu Asn Val Phe Leu 755 760 765 Ala Ile Ala Val Asp Asn Leu Ala Asp Ala Glu Ser Leu Asn Thr Ala 770 775 780 Gln Lys Glu Glu Ala Glu Glu Lys Glu Arg Lys Lys Ile Ala Arg Lys 785 790 795 800 Glu Ser Leu Glu Asn Lys Lys Asn Asn Lys Pro Glu Val Asn Gln Ile 805 810 815 Ala Asn Ser Asp Asn Lys Val Thr Ile Asp Asp Tyr Arg Glu Glu Asp 820 825 830 Glu Asp Lys Asp Pro Tyr Pro Pro Cys Asp Val Pro Val Gly Glu Glu 835 840 845 Glu Glu Glu Glu Glu Glu Asp Glu Pro Glu Val Pro Ala Gly Pro Arg 850 855 860 Pro Arg Arg Ile Ser Glu Leu Asn Met Lys Glu Lys Ile Ala Pro Ile 865 870 875 880 Pro Glu Gly Ser Ala Phe Phe Ile Leu Ser Lys Thr Asn Pro Ile Arg 885 890 895 Val Gly Cys His Lys Leu Ile Asn His His Ile Phe Thr Asn Leu Ile 900 905 910 Leu Val Phe Ile Met Leu Ser Ser Ala Ala Leu Ala Ala Glu Asp Pro 915 920 925 Ile Arg Ser His Ser Phe Arg Asn Thr Ile Leu Gly Tyr Phe Asp Tyr 930 935 940 Ala Phe Thr Ala Ile Phe Thr Val Glu Ile Leu Leu Lys Met Thr Thr 945 950 955 960 Phe Gly Ala Phe Leu His Lys Gly Ala Phe Cys Arg Asn Tyr Phe Asn 965 970 975 Leu Leu Asp Met Leu Val Val Gly Val Ser Leu Val Ser Phe Gly Ile 980 985 990 Gln Ser Ser Ala Ile Ser Val Val Lys Ile Leu Arg Val Leu Arg Val 995 1000 1005 Leu Arg Pro Leu Arg Ala Ile Asn Arg Ala Lys Gly Leu Lys His 1010 1015 1020 Val Val Gln Cys Val Phe Val Ala Ile Arg Thr Ile Gly Asn Ile 1025 1030 1035 Met Ile Val Thr Thr Leu Leu Gln Phe Met Phe Ala Cys Ile Gly 1040 1045 1050 Val Gln Leu Phe Lys Gly Lys Phe Tyr Arg Cys Thr Asp Glu Ala 1055 1060 1065 Lys Ser Asn Pro Glu Glu Cys Arg Gly Leu Phe Ile Leu Tyr Lys 1070 1075 1080 Asp Gly Asp Val Asp Ser Pro Val Val Arg Glu Arg Ile Trp Gln 1085 1090 1095 Asn Ser Asp Phe Asn Phe Asp Asn Val Leu Ser Ala Met Met Ala 1100 1105 1110 Leu Phe Thr Val Ser Thr Phe Glu Gly Trp Pro Ala Leu Leu Tyr 1115 1120 1125 Lys Ala Ile Asp Ser Asn Gly Glu Asn Ile Gly Pro Ile Tyr Asn 1130 1135 1140 His Arg Val Glu Ile Ser Ile Phe Phe Ile Ile Tyr Ile Ile Ile 1145 1150 1155 Val Ala Phe Phe Met Met Asn Ile Phe Val Gly Phe Val Ile Val 1160 1165 1170 Thr Phe Gln Glu Gln Gly Glu Lys Glu Tyr Lys Asn Cys Glu Leu 1175 1180 1185 Asp Lys Asn Gln Arg Gln Cys Val Glu Tyr Ala Leu Lys Ala Arg 1190 1195 1200 Pro Leu Arg Arg Tyr Ile Pro Lys Asn Pro Tyr Gln Tyr Lys Phe 1205 1210 1215 Trp Tyr Val Val Asn Ser Ser Pro Phe Glu Tyr Met Met Phe Val 1220 1225 1230 Leu Ile Met Leu Asn Thr Leu Cys Leu Ala Met Gln His Tyr Glu 1235 1240 1245 Gln Ser Lys Met Phe Asn Asp Ala Met Asp Ile Leu Asn Met Val 1250 1255 1260 Phe Thr Gly Val Phe Thr Val Glu Met Val Leu Lys Val Ile Ala 1265 1270 1275 Phe Lys Pro Lys Gly Tyr Phe Ser Asp Ala Trp Asn Thr Phe Asp 1280 1285 1290 Ser Leu Ile Val Ile Gly Ser Ile Ile Asp Val Ala Leu Ser Glu 1295 1300 1305 Ala Asp Pro Thr Glu Ser Glu Asn Val Pro Val Pro Thr Ala Thr 1310 1315 1320 Pro Gly Asn Ser Glu Glu Ser Asn Arg Ile Ser Ile Thr Phe Phe 1325 1330 1335 Arg Leu Phe Arg Val Met Arg Leu Val Lys Leu Leu Ser Arg Gly 1340 1345 1350 Glu Gly Ile Arg Thr Leu Leu Trp Thr Phe Ile Lys Ser Phe Gln 1355 1360 1365 Ala Leu Pro Tyr Val Ala Leu Leu Ile Ala Met Leu Phe Phe Ile 1370 1375 1380 Tyr Ala Val Ile Gly Met Gln Met Phe Gly Lys Val Ala Met Arg 1385 1390 1395 Asp Asn Asn Gln Ile Asn Arg Asn Asn Asn Phe Gln Thr Phe Pro 1400 1405 1410 Gln Ala Val Leu Leu Leu Phe Arg Cys Ala Thr Gly Glu Ala Trp 1415 1420 1425 Gln Glu Ile Met Leu Ala Cys Leu Pro Gly Lys Leu Cys Asp Pro 1430 1435 1440 Glu Ser Asp Tyr Asn Pro Gly Glu Glu Tyr Thr Cys Gly Ser Asn 1445 1450 1455 Phe Ala Ile Val Tyr Phe Ile Ser Phe Tyr Met Leu Cys Ala Phe 1460 1465 1470 Leu Ile Ile Asn Leu Phe Val Ala Val Ile Met Asp Asn Phe Asp 1475 1480 1485 Tyr Leu Thr Arg Asp Trp Ser Ile Leu Gly Pro His His Leu Asp 1490 1495 1500 Glu Phe Lys Arg Ile Trp Ser Glu Tyr Asp Pro Glu Ala Lys Gly 1505 1510 1515 Arg Ile Lys His Leu Asp Val Val Thr Leu Leu Arg Arg Ile Gln 1520 1525 1530 Pro Pro Leu Gly Phe Gly Lys Leu Cys Pro His Arg Val Ala Cys 1535 1540 1545 Lys Arg Leu Val Ala Met Asn Met Pro Leu Asn Ser Asp Gly Thr 1550 1555 1560 Val Met Phe Asn Ala Thr Leu Phe Ala Leu Val Arg Thr Ala Leu 1565 1570 1575 Lys Ile Lys Thr Glu Gly Asn Leu Glu Gln Ala Asn Glu Glu Leu 1580 1585 1590 Arg Ala Val Ile Lys Lys Ile Trp Lys Lys Thr Ser Met Lys Leu 1595 1600 1605 Leu Asp Gln Val Val Pro Pro Ala Gly Asp Asp Glu Val Thr Val 1610 1615 1620 Gly Lys Phe Tyr Ala Thr Phe Leu Ile Gln Asp Tyr Phe Arg Lys 1625 1630 1635 Phe Lys Lys Arg Lys Glu Gln Gly Leu Val Gly Lys Tyr Pro Ala 1640 1645 1650 Lys Asn Thr Thr Ile Ala Leu Gln Ala Gly Leu Arg Thr Leu His 1655 1660 1665 Asp Ile Gly Pro Glu Ile Arg Arg Ala Ile Ser Cys Asp Leu Gln 1670 1675 1680 Asp Asp Glu Pro Glu Glu Thr Lys Arg Glu Glu Glu Asp Asp Val 1685 1690 1695 Phe Lys Arg Asn Gly Ala Leu Leu Gly Asn His Val Asn His Val 1700 1705 1710 Asn Ser Asp Arg Arg Asp Ser Leu Gln Gln Thr Asn Thr Thr His 1715 1720 1725 Arg Pro Leu His Val Gln Arg Pro Ser Ile Pro Pro Ala Ser Asp 1730 1735 1740 Thr Glu Lys Pro Leu Phe Pro Pro Ala Gly Asn Ser Val Cys His 1745 1750 1755 Asn His His Asn His Asn Ser Ile Gly Lys Gln Val Pro Thr Ser 1760 1765 1770 Thr Asn Ala Asn Leu Asn Asn Ala Asn Met Ser Lys Ala Ala His 1775 1780 1785 Gly Lys Arg Pro Ser Ile Gly Asn Leu Glu His Val Ser Glu Asn 1790 1795 1800 Gly His His Ser Ser His Lys His Asp Arg Glu Pro Gln Arg Arg 1805 1810 1815 Ser Ser Val Lys Arg Thr Arg Tyr Tyr Glu Thr Tyr Ile Arg Ser 1820 1825 1830 Asp Ser Gly Asp Glu Gln Leu Pro Thr Ile Cys Arg Glu Asp Pro 1835 1840 1845 Glu Ile His Gly Tyr Phe Arg Asp Pro His Cys Leu Gly Glu Gln 1850 1855 1860 Glu Tyr Phe Ser Ser Glu Glu Cys Tyr Glu Asp Asp Ser Ser Pro 1865 1870 1875 Thr Trp Ser Arg Gln Asn Tyr Gly Tyr Tyr Ser Arg Tyr Pro Gly 1880 1885 1890 Arg Asn Ile Asp Ser Glu Arg Pro Arg Gly Tyr His His Pro Gln 1895 1900 1905 Gly Phe Leu Glu Asp Asp Asp Ser Pro Val Cys Tyr Asp Ser Arg 1910 1915 1920 Arg Ser Pro Arg Arg Arg Leu Leu Pro Pro Thr Pro Ala Ser His 1925 1930 1935 Arg Arg Ser Ser Phe Asn Phe Glu Cys Leu Arg Arg Gln Ser Ser 1940 1945 1950 Gln Glu Glu Val Pro Ser Ser Pro Ile Phe Pro His Arg Thr Ala 1955 1960 1965 Leu Pro Leu His Leu Met Gln Gln Gln Ile Met Ala Val Ala Gly 1970 1975 1980 Leu Asp Ser Ser Lys Ala Gln Lys Tyr Ser Pro Ser His Ser Thr 1985 1990 1995 Arg Ser Trp Ala Thr Pro Pro Ala Thr Pro Pro Tyr Arg Asp Trp 2000 2005 2010 Thr Pro Cys Tyr Thr Pro Leu Ile Gln Val Glu Gln Ser Glu Ala 2015 2020 2025 Leu Asp Gln Val Asn Gly Ser Leu Pro Ser Leu His Arg Ser Ser 2030 2035 2040 Trp Tyr Thr Asp Glu Pro Asp Ile Ser Tyr Arg Thr Phe Thr Pro 2045 2050 2055 Ala Ser Leu Thr Val Pro Ser Ser Phe Arg Asn Lys Asn Ser Asp 2060 2065 2070 Lys Gln Arg Ser Ala Asp Ser Leu Val Glu Ala Val Leu Ile Ser 2075 2080 2085 Glu Gly Leu Gly Arg Tyr Ala Arg Asp Pro Lys Phe Val Ser Ala 2090 2095 2100 Thr Lys His Glu Ile Ala Asp Ala Cys Asp Leu Thr Ile Asp Glu 2105 2110 2115 Met Glu Ser Ala Ala Ser Thr Leu Leu Asn Gly Asn Val Arg Pro 2120 2125 2130 Arg Ala Asn Gly Asp Val Gly Pro Leu Ser His Arg Gln Asp Tyr 2135 2140 2145 Glu Leu Gln Asp Phe Gly Pro Gly Tyr Ser Asp Glu Glu Pro Asp 2150 2155 2160 Pro Gly Arg Asp Glu Glu Asp Leu Ala Asp Glu Met Ile Cys Ile 2165 2170 2175 Thr Thr Leu 2180 21 6083 DNA Oryctolagus cuniculus gene (1)..(6083) 21 ttccacctac atgttggcct ggacagcagg gagccgaggg gaggctaatt ttactgctgg 60 gagcagctag cataatcctc ccgcccccac cccgctggct cagcagggca ggcttcgccc 120 ggcaagctca gcggcccagt ccccaaggcg gggaacactg gggacgcagg gaagagaggg 180 ccgcggggtg ggggagcagc aggaagcgcc gtggccaggg aagcc atg gag cca tcc 237 Met Glu Pro Ser 1 tca ccc cag gat gag ggc ctg agg aag aaa cag ccc aag aag ccc ctg 285 Ser Pro Gln Asp Glu Gly Leu Arg Lys Lys Gln Pro Lys Lys Pro Leu 5 10 15 20 ccc gag gtc ctg ccc agg ccg ccg cgg gct ctg ttc tgc ctg acc ctg 333 Pro Glu Val Leu Pro Arg Pro Pro Arg Ala Leu Phe Cys Leu Thr Leu 25 30 35 cag aac ccg ctg agg aag gcg tgc atc agc atc gtg gaa tgg aaa ccc 381 Gln Asn Pro Leu Arg Lys Ala Cys Ile Ser Ile Val Glu Trp Lys Pro 40 45 50 ttc gag acc atc atc ctg ctc acc atc ttt gcc aac tgt gtg gcc ctg 429 Phe Glu Thr Ile Ile Leu Leu Thr Ile Phe Ala Asn Cys Val Ala Leu 55 60 65 gcc gtg tac ctg ccc atg ccc gag gat gac aac aac tcc ctg aac ctg 477 Ala Val Tyr Leu Pro Met Pro Glu Asp Asp Asn Asn Ser Leu Asn Leu 70 75 80 ggc ctg gag aag ctg gag tac ttc ttc ctc acc gtc ttc tcc atc gaa 525 Gly Leu Glu Lys Leu Glu Tyr Phe Phe Leu Thr Val Phe Ser Ile Glu 85 90 95 100 gcc gcc atg aag atc atc gcc tac ggc ttc ctg ttc cac cag gac gcc 573 Ala Ala Met Lys Ile Ile Ala Tyr Gly Phe Leu Phe His Gln Asp Ala 105 110 115 tac ctg cgc agc ggc tgg aac gtg ctg gac ttc atc atc gtc ttc ctg 621 Tyr Leu Arg Ser Gly Trp Asn Val Leu Asp Phe Ile Ile Val Phe Leu 120 125 130 ggg gtc ttc acg gcg att ctg gaa cag gtc aac gtc atc cag agc aac 669 Gly Val Phe Thr Ala Ile Leu Glu Gln Val Asn Val Ile Gln Ser Asn 135 140 145 acg gcc ccg atg agc agc aaa gga gcc ggc ctg gac gtc aag gcc ctg 717 Thr Ala Pro Met Ser Ser Lys Gly Ala Gly Leu Asp Val Lys Ala Leu 150 155 160 agg gcc ttc cgt gtg ctc aga ccc ctc cgg ctg gtg tcg ggg gtg cct 765 Arg Ala Phe Arg Val Leu Arg Pro Leu Arg Leu Val Ser Gly Val Pro 165 170 175 180 agt ttg cag gtg gtc ctc aac tcc atc ttc aag gcc atg ctc ccc ctg 813 Ser Leu Gln Val Val Leu Asn Ser Ile Phe Lys Ala Met Leu Pro Leu 185 190 195 ttc cac atc gcc ctg ctc gtc ctc ttc atg gtc atc atc tac gcc atc 861 Phe His Ile Ala Leu Leu Val Leu Phe Met Val Ile Ile Tyr Ala Ile 200 205 210 atc ggg ctg gag ctc ttc aag ggc aag atg cac aag acc tgc tac tac 909 Ile Gly Leu Glu Leu Phe Lys Gly Lys Met His Lys Thr Cys Tyr Tyr 215 220 225 atc ggg aca gac atc gtg gcc aca gtg gag aat gag aag ccc tcg ccc 957 Ile Gly Thr Asp Ile Val Ala Thr Val Glu Asn Glu Lys Pro Ser Pro 230 235 240 tgc gct agg acg ggc tcg ggg cgc ccc tgc acc atc aac ggc agc gag 1005 Cys Ala Arg Thr Gly Ser Gly Arg Pro Cys Thr Ile Asn Gly Ser Glu 245 250 255 260 tgc cgg ggc ggc tgg ccg ggg ccc aac cac ggc atc acg cac ttc gac 1053 Cys Arg Gly Gly Trp Pro Gly Pro Asn His Gly Ile Thr His Phe Asp 265 270 275 aac ttc ggc ttc tcc atg ctc acc gtg tac cag tgc atc acc atg gag 1101 Asn Phe Gly Phe Ser Met Leu Thr Val Tyr Gln Cys Ile Thr Met Glu 280 285 290 ggc tgg aca gat gtc ctc tac tgg gtc aac gat gcc atc ggg aac gag 1149 Gly Trp Thr Asp Val Leu Tyr Trp Val Asn Asp Ala Ile Gly Asn Glu 295 300 305 tgg ccc tgg atc tac ttt gtc act ctc atc ctg ctg ggg tcc ttc ttc 1197 Trp Pro Trp Ile Tyr Phe Val Thr Leu Ile Leu Leu Gly Ser Phe Phe 310 315 320 atc ctc aac ctg gtg ctg ggc gtc ctg agt ggg gaa ttc acc aag gag 1245 Ile Leu Asn Leu Val Leu Gly Val Leu Ser Gly Glu Phe Thr Lys Glu 325 330 335 340 cgg gag aag gcc aag tcc agg gga acc ttc cag aag ctg cgg gag aag 1293 Arg Glu Lys Ala Lys Ser Arg Gly Thr Phe Gln Lys Leu Arg Glu Lys 345 350 355 cag cag ctg gag gag gac ctt cgg ggc tac atg agc tgg atc acg cag 1341 Gln Gln Leu Glu Glu Asp Leu Arg Gly Tyr Met Ser Trp Ile Thr Gln 360 365 370 ggc gag gtc atg gac gtg gag gac ctg aga gaa gga aag ctg tcc ttg 1389 Gly Glu Val Met Asp Val Glu Asp Leu Arg Glu Gly Lys Leu Ser Leu 375 380 385 gaa gag gga ggc tcc gac acg gaa agc ctg tac gaa atc gag ggc ttg 1437 Glu Glu Gly Gly Ser Asp Thr Glu Ser Leu Tyr Glu Ile Glu Gly Leu 390 395 400 aac aaa atc atc cag ttc atc cga cac tgg agg cag tgg aac cgt gtc 1485 Asn Lys Ile Ile Gln Phe Ile Arg His Trp Arg Gln Trp Asn Arg Val 405 410 415 420 ttt cgc tgg aag tgc cat gac ctg gtg aag tcg aga gtc ttc tac tgg 1533 Phe Arg Trp Lys Cys His Asp Leu Val Lys Ser Arg Val Phe Tyr Trp 425 430 435 ctg gtc atc ctg atc gtg gcc ctc aac acc ctg tcc atc gcc tcg gag 1581 Leu Val Ile Leu Ile Val Ala Leu Asn Thr Leu Ser Ile Ala Ser Glu 440 445 450 cac cac aac cag ccg ctc tgg ctg acc cac ttg caa gac atc gcc aat 1629 His His Asn Gln Pro Leu Trp Leu Thr His Leu Gln Asp Ile Ala Asn 455 460 465 cga gtg ctg ctg tca ctc ttc acc atc gag atg ctg ctg aag atg tac 1677 Arg Val Leu Leu Ser Leu Phe Thr Ile Glu Met Leu Leu Lys Met Tyr 470 475 480 ggg ctg ggc ctg cgc cag tac ttc atg tcc atc ttc aac cgc ttc gac 1725 Gly Leu Gly Leu Arg Gln Tyr Phe Met Ser Ile Phe Asn Arg Phe Asp 485 490 495 500 tgc ttc gtg gtg tgc agc ggc atc ctg gag ctg ctg ctg gtg gag tcg 1773 Cys Phe Val Val Cys Ser Gly Ile Leu Glu Leu Leu Leu Val Glu Ser 505 510 515 ggc gcc atg acg ccg ctg ggc atc tcc gtg ttg cgc tgc atc cgc ctc 1821 Gly Ala Met Thr Pro Leu Gly Ile Ser Val Leu Arg Cys Ile Arg Leu 520 525 530 ctg agg ctc ttc aag atc acc aag tac tgg acg tcg ctc agc aac ctg 1869 Leu Arg Leu Phe Lys Ile Thr Lys Tyr Trp Thr Ser Leu Ser Asn Leu 535 540 545 gtg gcc tcc ctg ctc aac tcc atc cgc tcc atc gcc tcg ctg ctg ctg 1917 Val Ala Ser Leu Leu Asn Ser Ile Arg Ser Ile Ala Ser Leu Leu Leu 550 555 560 ctg ctc ttc ctc ttc atc atc atc ttc gcc ctg ctg ggc atg cag ctc 1965 Leu Leu Phe Leu Phe Ile Ile Ile Phe Ala Leu Leu Gly Met Gln Leu 565 570 575 580 ttc ggg ggg cgg tac gac ttc gag gac acg gaa gtg cga cgc agc aac 2013 Phe Gly Gly Arg Tyr Asp Phe Glu Asp Thr Glu Val Arg Arg Ser Asn 585 590 595 ttc gac aac ttc ccc cag gcc ctc atc agc gtc ttc cag gtg ctg acg 2061 Phe Asp Asn Phe Pro Gln Ala Leu Ile Ser Val Phe Gln Val Leu Thr 600 605 610 ggt gag gac tgg aac tcc gtg atg tac aac ggg atc atg gcc tac gga 2109 Gly Glu Asp Trp Asn Ser Val Met Tyr Asn Gly Ile Met Ala Tyr Gly 615 620 625 ggc ccg tcc tac ccg ggc gtt ctc gtg tgc atc tat ttc atc atc ctt 2157 Gly Pro Ser Tyr Pro Gly Val Leu Val Cys Ile Tyr Phe Ile Ile Leu 630 635 640 ttt gtc tgc ggc aac tat atc ctg ctg aat gtc ttc ctg gcc atc gcc 2205 Phe Val Cys Gly Asn Tyr Ile Leu Leu Asn Val Phe Leu Ala Ile Ala 645 650 655 660 gtg gac aac ctg gcc gag gcg gag agc ctg act tcc gcg caa aag gcc 2253 Val Asp Asn Leu Ala Glu Ala Glu Ser Leu Thr Ser Ala Gln Lys Ala 665 670 675 aag gcc gag gag agg aaa cgc agg aag atg tcc agg ggt ctc cct gac 2301 Lys Ala Glu Glu Arg Lys Arg Arg Lys Met Ser Arg Gly Leu Pro Asp 680 685 690 aag aca gag gag gag aag tct gtg atg gcc aag aag ctg gag cag aag 2349 Lys Thr Glu Glu Glu Lys Ser Val Met Ala Lys Lys Leu Glu Gln Lys 695 700 705 ccc aag ggg gag ggc atc ccc acc act gcc aag ctc aag gtc gat gag 2397 Pro Lys Gly Glu Gly Ile Pro Thr Thr Ala Lys Leu Lys Val Asp Glu 710 715 720 ttc gaa tct aac gtc aac gag gtg aag gac ccc tac cct tca gct gac 2445 Phe Glu Ser Asn Val Asn Glu Val Lys Asp Pro Tyr Pro Ser Ala Asp 725 730 735 740 ttc cca ggg gat gat gag gag gac gag cct gag atc cca gtg agc ccc 2493 Phe Pro Gly Asp Asp Glu Glu Asp Glu Pro Glu Ile Pro Val Ser Pro 745 750 755 cga ccg cgc ccg ctg gcc gag ctg cag ctc aaa gag aag gca gtg ccc 2541 Arg Pro Arg Pro Leu Ala Glu Leu Gln Leu Lys Glu Lys Ala Val Pro 760 765 770 atc ccg gaa gcc agc tcc ttc ttc atc ttc agt ccc acc aat aag gtc 2589 Ile Pro Glu Ala Ser Ser Phe Phe Ile Phe Ser Pro Thr Asn Lys Val 775 780 785 cgt gtc ctg tgt cac cgc atc gtc aac gcc acc tgg ttc acc aac ttc 2637 Arg Val Leu Cys His Arg Ile Val Asn Ala Thr Trp Phe Thr Asn Phe 790 795 800 atc ctg ctc ttc atc ctg ctc agc agt gct gcg ctg gcc gcc gag gac 2685 Ile Leu Leu Phe Ile Leu Leu Ser Ser Ala Ala Leu Ala Ala Glu Asp 805 810 815 820 ccc atc cgg gcg gag tcc gtg agg aat cag atc ctt gga tat ttt gat 2733 Pro Ile Arg Ala Glu Ser Val Arg Asn Gln Ile Leu Gly Tyr Phe Asp 825 830 835 att gcc ttc acc tct gtc ttc act gtg gag att gtc ctc aag atg acg 2781 Ile Ala Phe Thr Ser Val Phe Thr Val Glu Ile Val Leu Lys Met Thr 840 845 850 acc tac ggc gcc ttc ctg cac aag ggc tcc ttc tgc cgc aac tac ttc 2829 Thr Tyr Gly Ala Phe Leu His Lys Gly Ser Phe Cys Arg Asn Tyr Phe 855 860 865 aac atc ctg gac ctg ctg gtg gtg gct gtg tct ctc atc tcc atg ggt 2877 Asn Ile Leu Asp Leu Leu Val Val Ala Val Ser Leu Ile Ser Met Gly 870 875 880 ctc gag tcc agc acc atc tcc gtg gta aag atc ctg aga gtg cta agg 2925 Leu Glu Ser Ser Thr Ile Ser Val Val Lys Ile Leu Arg Val Leu Arg 885 890 895 900 gtg ctc cgg ccc ctg cga gcc atc aac aga gcc aaa ggg ttg aag cac 2973 Val Leu Arg Pro Leu Arg Ala Ile Asn Arg Ala Lys Gly Leu Lys His 905 910 915 gtg gtc cag tgc gtg ttc gtg gcc atc cgc acc atc ggg aac atc gtc 3021 Val Val Gln Cys Val Phe Val Ala Ile Arg Thr Ile Gly Asn Ile Val 920 925 930 ctg gtc acc acg ctc ctg cag ttc atg ttc gcc tgc att ggt gtc cag 3069 Leu Val Thr Thr Leu Leu Gln Phe Met Phe Ala Cys Ile Gly Val Gln 935 940 945 ctc ttc aag ggc aag ttc ttc agc tgc aac gac cta tcc aag atg aca 3117 Leu Phe Lys Gly Lys Phe Phe Ser Cys Asn Asp Leu Ser Lys Met Thr 950 955 960 gaa gag gag tgc agg ggc tac tac tat gtg tac aag gac ggg gac ccc 3165 Glu Glu Glu Cys Arg Gly Tyr Tyr Tyr Val Tyr Lys Asp Gly Asp Pro 965 970 975 980 acg cag atg gag ctg cgc ccc cgc cag tgg ata cac aat gac ttc cac 3213 Thr Gln Met Glu Leu Arg Pro Arg Gln Trp Ile His Asn Asp Phe His 985 990 995 ttt gac aac gtg ctg tcg gcc atg atg tcg ctc ttc acg gtg tcc 3258 Phe Asp Asn Val Leu Ser Ala Met Met Ser Leu Phe Thr Val Ser 1000 1005 1010 acc ttc gag gga tgg ccc cag ctg ctg tac agg gcc ata gac tcc 3303 Thr Phe Glu Gly Trp Pro Gln Leu Leu Tyr Arg Ala Ile Asp Ser 1015 1020 1025 aac gag gag gac atg ggc ccc gtt tac aac aac cga gtg gag atg 3348 Asn Glu Glu Asp Met Gly Pro Val Tyr Asn Asn Arg Val Glu Met 1030 1035 1040 gcc atc ttc ttc atc atc tac atc atc ctc att gcc ttc ttc atg 3393 Ala Ile Phe Phe Ile Ile Tyr Ile Ile Leu Ile Ala Phe Phe Met 1045 1050 1055 atg aac atc ttt gtg ggc ttt gtc atc gtc acc ttc cag gag cag 3438 Met Asn Ile Phe Val Gly Phe Val Ile Val Thr Phe Gln Glu Gln 1060 1065 1070 ggg gag aca gag tac aag aac tgc gag ctg gac aag aac cag cgc 3483 Gly Glu Thr Glu Tyr Lys Asn Cys Glu Leu Asp Lys Asn Gln Arg 1075 1080 1085 cag tgt gtg cag tat gcc ctg aag gcc cgc cca ctt cgg tgc tac 3528 Gln Cys Val Gln Tyr Ala Leu Lys Ala Arg Pro Leu Arg Cys Tyr 1090 1095 1100 atc ccc aag aac cca tac cag tac cag gtg tgg tac gtc gtc acc 3573 Ile Pro Lys Asn Pro Tyr Gln Tyr Gln Val Trp Tyr Val Val Thr 1105 1110 1115 tcc tcc tac ttt gaa tac ctg atg ttc gcc ctc atc atg ctc aac 3618 Ser Ser Tyr Phe Glu Tyr Leu Met Phe Ala Leu Ile Met Leu Asn 1120 1125 1130 acc atc tgc ctg ggc atg cag cac tac cac cag tcg gag gag atg 3663 Thr Ile Cys Leu Gly Met Gln His Tyr His Gln Ser Glu Glu Met 1135 1140 1145 aac cac atc tcg gac atc ctc aac gtg gcc ttc acc atc atc ttc 3708 Asn His Ile Ser Asp Ile Leu Asn Val Ala Phe Thr Ile Ile Phe 1150 1155 1160 aca ctg gag atg atc ctc aag ctc ttg gcg ttc aag gcc agg ggc 3753 Thr Leu Glu Met Ile Leu Lys Leu Leu Ala Phe Lys Ala Arg Gly 1165 1170 1175 tat ttc gga gac ccc tgg aat gtg ttc gac ttc ctg atc gtc atc 3798 Tyr Phe Gly Asp Pro Trp Asn Val Phe Asp Phe Leu Ile Val Ile 1180 1185 1190 ggc agc atc att gac gtc atc ctc agc gag atc gac act ttc ctg 3843 Gly Ser Ile Ile Asp Val Ile Leu Ser Glu Ile Asp Thr Phe Leu 1195 1200 1205 gcc tcc agc ggg gga ctg tat tgc ctg ggt ggc ggc tgc ggg aac 3888 Ala Ser Ser Gly Gly Leu Tyr Cys Leu Gly Gly Gly Cys Gly Asn 1210 1215 1220 gtt gac cca gac gag agc gcc cgc atc tcc agt gcc ttc ttc cgc 3933 Val Asp Pro Asp Glu Ser Ala Arg Ile Ser Ser Ala Phe Phe Arg 1225 1230 1235 ctg ttc cgg gtc atg agg ctg atc aag ctg ctg agt cgg gcc gag 3978 Leu Phe Arg Val Met Arg Leu Ile Lys Leu Leu Ser Arg Ala Glu 1240 1245 1250 ggc gtg cgc acg ctg ctg tgg acg ttc atc aag tcc ttc cag gcc 4023 Gly Val Arg Thr Leu Leu Trp Thr Phe Ile Lys Ser Phe Gln Ala 1255 1260 1265 ctg ccc tac gtg gcc ctg ctc atc gtc atg ctg ttc ttc atc tac 4068 Leu Pro Tyr Val Ala Leu Leu Ile Val Met Leu Phe Phe Ile Tyr 1270 1275 1280 gcc gtc atc ggc atg cag atg ttt gga aag atc gcc ctg gtg gac 4113 Ala Val Ile Gly Met Gln Met Phe Gly Lys Ile Ala Leu Val Asp 1285 1290 1295 ggg acc cag atc aac cgc aac aac aac ttc cag acc ttc ccg cag 4158 Gly Thr Gln Ile Asn Arg Asn Asn Asn Phe Gln Thr Phe Pro Gln 1300 1305 1310 gcc gtg ctg ctg ctc ttc agg tgt gcg aca ggg gag gcg tgg caa 4203 Ala Val Leu Leu Leu Phe Arg Cys Ala Thr Gly Glu Ala Trp Gln 1315 1320 1325 gag atc ctg ctg gcc tgc agc tac ggg aag ttg tgc gac cca gag 4248 Glu Ile Leu Leu Ala Cys Ser Tyr Gly Lys Leu Cys Asp Pro Glu 1330 1335 1340 tca gac tac gcc ccg ggc gag gag tac acg tgt ggc acc aac ttc 4293 Ser Asp Tyr Ala Pro Gly Glu Glu Tyr Thr Cys Gly Thr Asn Phe 1345 1350 1355 gcc tac tac tac ttc atc agc ttc tac atg ctc tgc gcc ttc ctg 4338 Ala Tyr Tyr Tyr Phe Ile Ser Phe Tyr Met Leu Cys Ala Phe Leu 1360 1365 1370 atc atc aac ctc ttc gtg gct gtc atc atg gac aac ttt gac tac 4383 Ile Ile Asn Leu Phe Val Ala Val Ile Met Asp Asn Phe Asp Tyr 1375 1380 1385 ctg aca cgc gac tgg tcc atc ctg ggc cct cac cac ctg gac gag 4428 Leu Thr Arg Asp Trp Ser Ile Leu Gly Pro His His Leu Asp Glu 1390 1395 1400 ttc aag gct atc tgg gca gag tat gac cca gag gcc aag ggg cga 4473 Phe Lys Ala Ile Trp Ala Glu Tyr Asp Pro Glu Ala Lys Gly Arg 1405 1410 1415 atc aag cac ctg gac gtg gtg acc ctg ctg aga agg atc cag ccc 4518 Ile Lys His Leu Asp Val Val Thr Leu Leu Arg Arg Ile Gln Pro 1420 1425 1430 cct ctg ggc ttc ggg aag ttc tgt cca cac cgg gtg gcc tgt aag 4563 Pro Leu Gly Phe Gly Lys Phe Cys Pro His Arg Val Ala Cys Lys 1435 1440 1445 cgc ctg gtg ggc atg aac atg ccc ctg aac agt gac ggc acg gtc 4608 Arg Leu Val Gly Met Asn Met Pro Leu Asn Ser Asp Gly Thr Val 1450 1455 1460 acc ttc aat gcc acg ctc ttt gcc ctg gtg cgc acg gcc ctc aag 4653 Thr Phe Asn Ala Thr Leu Phe Ala Leu Val Arg Thr Ala Leu Lys 1465 1470 1475 atc aag aca gaa ggt aac ttt gag cag gcc aac gag gag ctg agg 4698 Ile Lys Thr Glu Gly Asn Phe Glu Gln Ala Asn Glu Glu Leu Arg 1480 1485 1490 gcc atc atc aag aag atc tgg aag aga acc agc atg aag ctg ctg 4743 Ala Ile Ile Lys Lys Ile Trp Lys Arg Thr Ser Met Lys Leu Leu 1495 1500 1505 gac cag gtc atc cct ccc ata gga gat gac gag gtg acc gtg ggg 4788 Asp Gln Val Ile Pro Pro Ile Gly Asp Asp Glu Val Thr Val Gly 1510 1515 1520 aag ttc tac gcc aca ttc ctc atc cag gag cac ttc cgg aag ttc 4833 Lys Phe Tyr Ala Thr Phe Leu Ile Gln Glu His Phe Arg Lys Phe 1525 1530 1535 atg aag cgc cag gag gaa tat tat ggg tat cgg ccc aag aag gac 4878 Met Lys Arg Gln Glu Glu Tyr Tyr Gly Tyr Arg Pro Lys Lys Asp 1540 1545 1550 acc gtg cag atc cag gct ggg ctg cgg acc ata gag gag gag gcg 4923 Thr Val Gln Ile Gln Ala Gly Leu Arg Thr Ile Glu Glu Glu Ala 1555 1560 1565 gcc cct gag atc cgc cgc acc atc tca gga gac ctg acc gcc gag 4968 Ala Pro Glu Ile Arg Arg Thr Ile Ser Gly Asp Leu Thr Ala Glu 1570 1575 1580 gag gag ctg gag aga gcc atg gtg gag gct gcg atg gag gag agg 5013 Glu Glu Leu Glu Arg Ala Met Val Glu Ala Ala Met Glu Glu Arg 1585 1590 1595 atc ttc cgg agg acg gga ggc ctg ttt ggc cag gtg gac acc ttc 5058 Ile Phe Arg Arg Thr Gly Gly Leu Phe Gly Gln Val Asp Thr Phe 1600 1605 1610 ctg gaa agg acc aac tcc ctg ccc ccg gtg atg gcc aac caa aga 5103 Leu Glu Arg Thr Asn Ser Leu Pro Pro Val Met Ala Asn Gln Arg 1615 1620 1625 ccg ctc cag ttt gct gag ata gaa atg gaa gag ctt gag tcg cct 5148 Pro Leu Gln Phe Ala Glu Ile Glu Met Glu Glu Leu Glu Ser Pro 1630 1635 1640 gtc ttc ttg gag gac ttc cct caa gat gca aga acc aac cct ctc 5193 Val Phe Leu Glu Asp Phe Pro Gln Asp Ala Arg Thr Asn Pro Leu 1645 1650 1655 gct cgt gcc aat acc aac aac gcc aat gcc aat gtt gcc tat ggc 5238 Ala Arg Ala Asn Thr Asn Asn Ala Asn Ala Asn Val Ala Tyr Gly 1660 1665 1670 aac agc aac cat agc aac aac cag atg ttt tcc agc gtc cac tgt 5283 Asn Ser Asn His Ser Asn Asn Gln Met Phe Ser Ser Val His Cys 1675 1680 1685 gaa agg gag ttc ccg gga gag gcg gag aca ccg gct gcc gga cga 5328 Glu Arg Glu Phe Pro Gly Glu Ala Glu Thr Pro Ala Ala Gly Arg 1690 1695 1700 gga gcc ctc agc cac tcc cac agg gcc ctg gga cct cac agc aag 5373 Gly Ala Leu Ser His Ser His Arg Ala Leu Gly Pro His Ser Lys 1705 1710 1715 ccc tgt gct gga aaa ctg aat ggg cag ctg gtc cag ccg ggg atg 5418 Pro Cys Ala Gly Lys Leu Asn Gly Gln Leu Val Gln Pro Gly Met 1720 1725 1730 ccc atc aac cag gca cct cct gcc ccc tgc cag cag cct agc acg 5463 Pro Ile Asn Gln Ala Pro Pro Ala Pro Cys Gln Gln Pro Ser Thr 1735 1740 1745 gat ccc cca gag cgc ggg cag agg agg acc tcc ctg aca ggg tct 5508 Asp Pro Pro Glu Arg Gly Gln Arg Arg Thr Ser Leu Thr Gly Ser 1750 1755 1760 ctg caa gac gaa gca ccc cag agg agg agc tcc gag ggg agc acc 5553 Leu Gln Asp Glu Ala Pro Gln Arg Arg Ser Ser Glu Gly Ser Thr 1765 1770 1775 ccc agg cgc ccg gct cct gct aca gct ctg ctg atc caa gag gct 5598 Pro Arg Arg Pro Ala Pro Ala Thr Ala Leu Leu Ile Gln Glu Ala 1780 1785 1790 ctg gtt cga ggg ggc ctg gac acc ttg gca gct gat gct ggc ttc 5643 Leu Val Arg Gly Gly Leu Asp Thr Leu Ala Ala Asp Ala Gly Phe 1795 1800 1805 gtc acg gca aca agc cag gcc ctg gca gac gcc tgt cag atg gaa 5688 Val Thr Ala Thr Ser Gln Ala Leu Ala Asp Ala Cys Gln Met Glu 1810 1815 1820 ccg gag gaa gta gag gtc gca gcc aca gag cta ctg aaa gcg cga 5733 Pro Glu Glu Val Glu Val Ala Ala Thr Glu Leu Leu Lys Ala Arg 1825 1830 1835 gag tct gtc cag ggc atg gcc agt gtc ccg gga agc ctg agc cgc 5778 Glu Ser Val Gln Gly Met Ala Ser Val Pro Gly Ser Leu Ser Arg 1840 1845 1850 agg tcc tcc ctg ggc agc ctt gac cag gtc cag ggc tcc cag gaa 5823 Arg Ser Ser Leu Gly Ser Leu Asp Gln Val Gln Gly Ser Gln Glu 1855 1860 1865 acc ctt att cct ccc agg ccg tga tggctgtggt gtccacatga ccaaggcgag 5877 Thr Leu Ile Pro Pro Arg Pro 1870 agggacagtg cgtgcagaag ctcagccctg catggcagcc tccctctgtc tcagccctcc 5937 tgctgagctg gggcggtctg gaaccgcacc aggaagccag gagcctcccc tggccagcaa 5997 gaggcatgat tctaaagcca tccagaaagg cctggtcagt gccactcccc agcaggacat 6057 taaagtctct aggtctgtgg cactgg 6083 22 1873 PRT Oryctolagus cuniculus 22 Met Glu Pro Ser Ser Pro Gln Asp Glu Gly Leu Arg Lys Lys Gln Pro 1 5 10 15 Lys Lys Pro Leu Pro Glu Val Leu Pro Arg Pro Pro Arg Ala Leu Phe 20 25 30 Cys Leu Thr Leu Gln Asn Pro Leu Arg Lys Ala Cys Ile Ser Ile Val 35 40 45 Glu Trp Lys Pro Phe Glu Thr Ile Ile Leu Leu Thr Ile Phe Ala Asn 50 55 60 Cys Val Ala Leu Ala Val Tyr Leu Pro Met Pro Glu Asp Asp Asn Asn 65 70 75 80 Ser Leu Asn Leu Gly Leu Glu Lys Leu Glu Tyr Phe Phe Leu Thr Val 85 90 95 Phe Ser Ile Glu Ala Ala Met Lys Ile Ile Ala Tyr Gly Phe Leu Phe 100 105 110 His Gln Asp Ala Tyr Leu Arg Ser Gly Trp Asn Val Leu Asp Phe Ile 115 120 125 Ile Val Phe Leu Gly Val Phe Thr Ala Ile Leu Glu Gln Val Asn Val 130 135 140 Ile Gln Ser Asn Thr Ala Pro Met Ser Ser Lys Gly Ala Gly Leu Asp 145 150 155 160 Val Lys Ala Leu Arg Ala Phe Arg Val Leu Arg Pro Leu Arg Leu Val 165 170 175 Ser Gly Val Pro Ser Leu Gln Val Val Leu Asn Ser Ile Phe Lys Ala 180 185 190 Met Leu Pro Leu Phe His Ile Ala Leu Leu Val Leu Phe Met Val Ile 195 200 205 Ile Tyr Ala Ile Ile Gly Leu Glu Leu Phe Lys Gly Lys Met His Lys 210 215 220 Thr Cys Tyr Tyr Ile Gly Thr Asp Ile Val Ala Thr Val Glu Asn Glu 225 230 235 240 Lys Pro Ser Pro Cys Ala Arg Thr Gly Ser Gly Arg Pro Cys Thr Ile 245 250 255 Asn Gly Ser Glu Cys Arg Gly Gly Trp Pro Gly Pro Asn His Gly Ile 260 265 270 Thr His Phe Asp Asn Phe Gly Phe Ser Met Leu Thr Val Tyr Gln Cys 275 280 285 Ile Thr Met Glu Gly Trp Thr Asp Val Leu Tyr Trp Val Asn Asp Ala 290 295 300 Ile Gly Asn Glu Trp Pro Trp Ile Tyr Phe Val Thr Leu Ile Leu Leu 305 310 315 320 Gly Ser Phe Phe Ile Leu Asn Leu Val Leu Gly Val Leu Ser Gly Glu 325 330 335 Phe Thr Lys Glu Arg Glu Lys Ala Lys Ser Arg Gly Thr Phe Gln Lys 340 345 350 Leu Arg Glu Lys Gln Gln Leu Glu Glu Asp Leu Arg Gly Tyr Met Ser 355 360 365 Trp Ile Thr Gln Gly Glu Val Met Asp Val Glu Asp Leu Arg Glu Gly 370 375 380 Lys Leu Ser Leu Glu Glu Gly Gly Ser Asp Thr Glu Ser Leu Tyr Glu 385 390 395 400 Ile Glu Gly Leu Asn Lys Ile Ile Gln Phe Ile Arg His Trp Arg Gln 405 410 415 Trp Asn Arg Val Phe Arg Trp Lys Cys His Asp Leu Val Lys Ser Arg 420 425 430 Val Phe Tyr Trp Leu Val Ile Leu Ile Val Ala Leu Asn Thr Leu Ser 435 440 445 Ile Ala Ser Glu His His Asn Gln Pro Leu Trp Leu Thr His Leu Gln 450 455 460 Asp Ile Ala Asn Arg Val Leu Leu Ser Leu Phe Thr Ile Glu Met Leu 465 470 475 480 Leu Lys Met Tyr Gly Leu Gly Leu Arg Gln Tyr Phe Met Ser Ile Phe 485 490 495 Asn Arg Phe Asp Cys Phe Val Val Cys Ser Gly Ile Leu Glu Leu Leu 500 505 510 Leu Val Glu Ser Gly Ala Met Thr Pro Leu Gly Ile Ser Val Leu Arg 515 520 525 Cys Ile Arg Leu Leu Arg Leu Phe Lys Ile Thr Lys Tyr Trp Thr Ser 530 535 540 Leu Ser Asn Leu Val Ala Ser Leu Leu Asn Ser Ile Arg Ser Ile Ala 545 550 555 560 Ser Leu Leu Leu Leu Leu Phe Leu Phe Ile Ile Ile Phe Ala Leu Leu 565 570 575 Gly Met Gln Leu Phe Gly Gly Arg Tyr Asp Phe Glu Asp Thr Glu Val 580 585 590 Arg Arg Ser Asn Phe Asp Asn Phe Pro Gln Ala Leu Ile Ser Val Phe 595 600 605 Gln Val Leu Thr Gly Glu Asp Trp Asn Ser Val Met Tyr Asn Gly Ile 610 615 620 Met Ala Tyr Gly Gly Pro Ser Tyr Pro Gly Val Leu Val Cys Ile Tyr 625 630 635 640 Phe Ile Ile Leu Phe Val Cys Gly Asn Tyr Ile Leu Leu Asn Val Phe 645 650 655 Leu Ala Ile Ala Val Asp Asn Leu Ala Glu Ala Glu Ser Leu Thr Ser 660 665 670 Ala Gln Lys Ala Lys Ala Glu Glu Arg Lys Arg Arg Lys Met Ser Arg 675 680 685 Gly Leu Pro Asp Lys Thr Glu Glu Glu Lys Ser Val Met Ala Lys Lys 690 695 700 Leu Glu Gln Lys Pro Lys Gly Glu Gly Ile Pro Thr Thr Ala Lys Leu 705 710 715 720 Lys Val Asp Glu Phe Glu Ser Asn Val Asn Glu Val Lys Asp Pro Tyr 725 730 735 Pro Ser Ala Asp Phe Pro Gly Asp Asp Glu Glu Asp Glu Pro Glu Ile 740 745 750 Pro Val Ser Pro Arg Pro Arg Pro Leu Ala Glu Leu Gln Leu Lys Glu 755 760 765 Lys Ala Val Pro Ile Pro Glu Ala Ser Ser Phe Phe Ile Phe Ser Pro 770 775 780 Thr Asn Lys Val Arg Val Leu Cys His Arg Ile Val Asn Ala Thr Trp 785 790 795 800 Phe Thr Asn Phe Ile Leu Leu Phe Ile Leu Leu Ser Ser Ala Ala Leu 805 810 815 Ala Ala Glu Asp Pro Ile Arg Ala Glu Ser Val Arg Asn Gln Ile Leu 820 825 830 Gly Tyr Phe Asp Ile Ala Phe Thr Ser Val Phe Thr Val Glu Ile Val 835 840 845 Leu Lys Met Thr Thr Tyr Gly Ala Phe Leu His Lys Gly Ser Phe Cys 850 855 860 Arg Asn Tyr Phe Asn Ile Leu Asp Leu Leu Val Val Ala Val Ser Leu 865 870 875 880 Ile Ser Met Gly Leu Glu Ser Ser Thr Ile Ser Val Val Lys Ile Leu 885 890 895 Arg Val Leu Arg Val Leu Arg Pro Leu Arg Ala Ile Asn Arg Ala Lys 900 905 910 Gly Leu Lys His Val Val Gln Cys Val Phe Val Ala Ile Arg Thr Ile 915 920 925 Gly Asn Ile Val Leu Val Thr Thr Leu Leu Gln Phe Met Phe Ala Cys 930 935 940 Ile Gly Val Gln Leu Phe Lys Gly Lys Phe Phe Ser Cys Asn Asp Leu 945 950 955 960 Ser Lys Met Thr Glu Glu Glu Cys Arg Gly Tyr Tyr Tyr Val Tyr Lys 965 970 975 Asp Gly Asp Pro Thr Gln Met Glu Leu Arg Pro Arg Gln Trp Ile His 980 985 990 Asn Asp Phe His Phe Asp Asn Val Leu Ser Ala Met Met Ser Leu Phe 995 1000 1005 Thr Val Ser Thr Phe Glu Gly Trp Pro Gln Leu Leu Tyr Arg Ala 1010 1015 1020 Ile Asp Ser Asn Glu Glu Asp Met Gly Pro Val Tyr Asn Asn Arg 1025 1030 1035 Val Glu Met Ala Ile Phe Phe Ile Ile Tyr Ile Ile Leu Ile Ala 1040 1045 1050 Phe Phe Met Met Asn Ile Phe Val Gly Phe Val Ile Val Thr Phe 1055 1060 1065 Gln Glu Gln Gly Glu Thr Glu Tyr Lys Asn Cys Glu Leu Asp Lys 1070 1075 1080 Asn Gln Arg Gln Cys Val Gln Tyr Ala Leu Lys Ala Arg Pro Leu 1085 1090 1095 Arg Cys Tyr Ile Pro Lys Asn Pro Tyr Gln Tyr Gln Val Trp Tyr 1100 1105 1110 Val Val Thr Ser Ser Tyr Phe Glu Tyr Leu Met Phe Ala Leu Ile 1115 1120 1125 Met Leu Asn Thr Ile Cys Leu Gly Met Gln His Tyr His Gln Ser 1130 1135 1140 Glu Glu Met Asn His Ile Ser Asp Ile Leu Asn Val Ala Phe Thr 1145 1150 1155 Ile Ile Phe Thr Leu Glu Met Ile Leu Lys Leu Leu Ala Phe Lys 1160 1165 1170 Ala Arg Gly Tyr Phe Gly Asp Pro Trp Asn Val Phe Asp Phe Leu 1175 1180 1185 Ile Val Ile Gly Ser Ile Ile Asp Val Ile Leu Ser Glu Ile Asp 1190 1195 1200 Thr Phe Leu Ala Ser Ser Gly Gly Leu Tyr Cys Leu Gly Gly Gly 1205 1210 1215 Cys Gly Asn Val Asp Pro Asp Glu Ser Ala Arg Ile Ser Ser Ala 1220 1225 1230 Phe Phe Arg Leu Phe Arg Val Met Arg Leu Ile Lys Leu Leu Ser 1235 1240 1245 Arg Ala Glu Gly Val Arg Thr Leu Leu Trp Thr Phe Ile Lys Ser 1250 1255 1260 Phe Gln Ala Leu Pro Tyr Val Ala Leu Leu Ile Val Met Leu Phe 1265 1270 1275 Phe Ile Tyr Ala Val Ile Gly Met Gln Met Phe Gly Lys Ile Ala 1280 1285 1290 Leu Val Asp Gly Thr Gln Ile Asn Arg Asn Asn Asn Phe Gln Thr 1295 1300 1305 Phe Pro Gln Ala Val Leu Leu Leu Phe Arg Cys Ala Thr Gly Glu 1310 1315 1320 Ala Trp Gln Glu Ile Leu Leu Ala Cys Ser Tyr Gly Lys Leu Cys 1325 1330 1335 Asp Pro Glu Ser Asp Tyr Ala Pro Gly Glu Glu Tyr Thr Cys Gly 1340 1345 1350 Thr Asn Phe Ala Tyr Tyr Tyr Phe Ile Ser Phe Tyr Met Leu Cys 1355 1360 1365 Ala Phe Leu Ile Ile Asn Leu Phe Val Ala Val Ile Met Asp Asn 1370 1375 1380 Phe Asp Tyr Leu Thr Arg Asp Trp Ser Ile Leu Gly Pro His His 1385 1390 1395 Leu Asp Glu Phe Lys Ala Ile Trp Ala Glu Tyr Asp Pro Glu Ala 1400 1405 1410 Lys Gly Arg Ile Lys His Leu Asp Val Val Thr Leu Leu Arg Arg 1415 1420 1425 Ile Gln Pro Pro Leu Gly Phe Gly Lys Phe Cys Pro His Arg Val 1430 1435 1440 Ala Cys Lys Arg Leu Val Gly Met Asn Met Pro Leu Asn Ser Asp 1445 1450 1455 Gly Thr Val Thr Phe Asn Ala Thr Leu Phe Ala Leu Val Arg Thr 1460 1465 1470 Ala Leu Lys Ile Lys Thr Glu Gly Asn Phe Glu Gln Ala Asn Glu 1475 1480 1485 Glu Leu Arg Ala Ile Ile Lys Lys Ile Trp Lys Arg Thr Ser Met 1490 1495 1500 Lys Leu Leu Asp Gln Val Ile Pro Pro Ile Gly Asp Asp Glu Val 1505 1510 1515 Thr Val Gly Lys Phe Tyr Ala Thr Phe Leu Ile Gln Glu His Phe 1520 1525 1530 Arg Lys Phe Met Lys Arg Gln Glu Glu Tyr Tyr Gly Tyr Arg Pro 1535 1540 1545 Lys Lys Asp Thr Val Gln Ile Gln Ala Gly Leu Arg Thr Ile Glu 1550 1555 1560 Glu Glu Ala Ala Pro Glu Ile Arg Arg Thr Ile Ser Gly Asp Leu 1565 1570 1575 Thr Ala Glu Glu Glu Leu Glu Arg Ala Met Val Glu Ala Ala Met 1580 1585 1590 Glu Glu Arg Ile Phe Arg Arg Thr Gly Gly Leu Phe Gly Gln Val 1595 1600 1605 Asp Thr Phe Leu Glu Arg Thr Asn Ser Leu Pro Pro Val Met Ala 1610 1615 1620 Asn Gln Arg Pro Leu Gln Phe Ala Glu Ile Glu Met Glu Glu Leu 1625 1630 1635 Glu Ser Pro Val Phe Leu Glu Asp Phe Pro Gln Asp Ala Arg Thr 1640 1645 1650 Asn Pro Leu Ala Arg Ala Asn Thr Asn Asn Ala Asn Ala Asn Val 1655 1660 1665 Ala Tyr Gly Asn Ser Asn His Ser Asn Asn Gln Met Phe Ser Ser 1670 1675 1680 Val His Cys Glu Arg Glu Phe Pro Gly Glu Ala Glu Thr Pro Ala 1685 1690 1695 Ala Gly Arg Gly Ala Leu Ser His Ser His Arg Ala Leu Gly Pro 1700 1705 1710 His Ser Lys Pro Cys Ala Gly Lys Leu Asn Gly Gln Leu Val Gln 1715 1720 1725 Pro Gly Met Pro Ile Asn Gln Ala Pro Pro Ala Pro Cys Gln Gln 1730 1735 1740 Pro Ser Thr Asp Pro Pro Glu Arg Gly Gln Arg Arg Thr Ser Leu 1745 1750 1755 Thr Gly Ser Leu Gln Asp Glu Ala Pro Gln Arg Arg Ser Ser Glu 1760 1765 1770 Gly Ser Thr Pro Arg Arg Pro Ala Pro Ala Thr Ala Leu Leu Ile 1775 1780 1785 Gln Glu Ala Leu Val Arg Gly Gly Leu Asp Thr Leu Ala Ala Asp 1790 1795 1800 Ala Gly Phe Val Thr Ala Thr Ser Gln Ala Leu Ala Asp Ala Cys 1805 1810 1815 Gln Met Glu Pro Glu Glu Val Glu Val Ala Ala Thr Glu Leu Leu 1820 1825 1830 Lys Ala Arg Glu Ser Val Gln Gly Met Ala Ser Val Pro Gly Ser 1835 1840 1845 Leu Ser Arg Arg Ser Ser Leu Gly Ser Leu Asp Gln Val Gln Gly 1850 1855 1860 Ser Gln Glu Thr Leu Ile Pro Pro Arg Pro 1865 1870 23 57 DNA Homo sapiens partial_cDNA (1)..(57) 23 actttcctgg cctccagcgg gggactgtat tgcctgggtg gaggctgcgg gaacgtt 57 24 19 PRT Homo sapiens DOMAIN (1)..(19) 24 Thr Phe Leu Ala Ser Ser Gly Gly Leu Tyr Cys Leu Gly Gly Gly Cys 1 5 10 15 Gly Asn Val 25 45 DNA Homo sapiens partial_cDNA (1)..(45) 25 ccaactgaaa gtgaaaatgt ccctgtccca actgctacac ctggg 45 26 15 PRT Homo sapiens DOMAIN (1)..(15) 26 Pro Thr Glu Ser Glu Asn Val Pro Val Pro Thr Ala Thr Pro Gly 1 5 10 15 27 20 PRT Homo sapiens 27 Asn Glu Glu Leu Arg Ala Ile Ile Lys Lys Ile Trp Lys Arg Thr Ser 1 5 10 15 Met Lys Leu Leu 20 28 35 DNA Homo sapiens 28 ggccctcagc gaagcggaca actctgaaga gagca 35 29 35 DNA Homo sapiens 29 atcctcagtg agatcgacga cccagatgag agtgc 35 

What is claimed is:
 1. An isolated and purified platelet voltage dependent calcium channel (VDCC) α₁ subunit polypeptide.
 2. The isolated and purified platelet VDCC α₁ subunit polypeptide of claim 1, wherein the polypeptide comprises: (a) a polypeptide encoded by a nucleic acid molecule having the nucleotide sequence set forth as any of SEQ ID NOs:1, 3, 5-8, 28, or 29; (b) a polypeptide encoded by a nucleic acid molecule that is substantially identical to any of NOs:1, 3, 5-8, 28, or 29; (c) a polypeptide having the amino acid sequence set forth as SEQ DI NO:2 or 4; (d) a polypeptide that is a biological equivalent of the polypeptide of SEQ ID NO:2 or 4; or (e) a polypeptide which is immunologically cross-reactive with an antibody that shows specific binding with a polypeptide of SEQ ID NO:2 or
 4. 3. The polypeptide of claim 1, wherein the platelet VDCC α₁ subunit polypeptide comprises a platelet VDCC α₁S subunit polypeptide or a platelet VDCC α₁D subunit polypeptide.
 4. The polypeptide of claim 1, modified to be in detectably labeled form.
 5. An isolated and purified antibody capable of specifically binding to a polypeptide of claim
 1. 6. The antibody of claim 5, wherein the antibody is capable of modulating the biological activity of the polypeptide to which it specifically binds.
 7. A hybridoma cell line which produces an antibody of claim
 5. 8. An isolated and purified nucleic acid molecule encoding a platelet VDCC α₁ subunit polypeptide of claim
 1. 9. The nucleic acid molecule of claim 8, wherein the encoded platelet VDCC α₁ subunit polypeptide comprises a platelet VDCC α₁S subunit polypeptide or a platelet VDCC α₁D subunit polypeptide.
 10. The nucleic acid molecule of claim 8, wherein the encoded polypeptide comprises: (a) a polypeptide having the amino acid sequence set forth as SEQ ID NO:2 or 4; (b) a polypeptide that is a biological equivalent of the polypeptide of SEQ ID NOs:2 or 4; or (c) a polypeptide which is immunologically cross-reactive with an antibody that shows specific binding with a polypeptide of any of SEQ ID NO:2 or
 4. 11. The nucleic acid molecule of claim 8, comprising: (a) a nucleic acid molecule having the nucleotide sequence set forth as any of SEQ ID NOs:1, 3, 5-8, 28, or 29; (b) a nucleic acid molecule that is substantially identical to any of SEQ ID NOs:1, 3, 5-8, 28, or 29;
 12. The isolated nucleic acid molecule of claim 8, comprising a 20 nucleotide sequence that is identical to a contiguous 20 nucleotide sequence of SEQ ID NOs:28 or
 29. 13. The nucleic acid molecule of claim 8, further defined as a DNA segment.
 14. The nucleic acid molecule of claim 13, further defined as positioned under the control of a promoter.
 15. The nucleic acid molecule of claim 14, wherein said DNA segment and promoter are operationally inserted into a recombinant vector.
 16. A recombinant host cell comprising the nucleic acid molecule of claim
 8. 17. The recombinant host cell of claim 16, wherein the cell further comprises a platelet or a megakaryocyte.
 18. A method of producing an antibody that specifically binds a platelet VDCC α₁ subunit polypeptide, the method comprising: (a) transfecting a recombinant host cell with a nucleic acid molecule that encodes a platelet VDCC α₁ subunit polypeptide of claim 1; (b) culturing the host cell under conditions sufficient for expression of the polypeptide; (c) recovering the polypeptide; and (d) preparing an antibody to the polypeptide, wherein the antibody specifically binds the polypeptide.
 19. The method of claim 18, wherein the polypeptide comprises a polypeptide as set forth as SEQ ID NO:2 or
 4. 20. The method of claim 18, wherein the nucleic acid molecule comprises a nucleotide sequence as set forth in any of SEQ ID NOs:1, 3, 5-8, 28, or
 29. 21. A method of detecting a platelet VDCC α₁ subunit polypeptide, the method comprising immunoreacting the polypeptide with an antibody prepared according the method of claim 18 to form an antibody-polypeptide conjugate; and detecting the conjugate.
 22. An assay kit for detecting the presence of a platelet VDCC α₁ subunit polypeptide in a biological sample, the kit comprising a first antibody that specifically binds a polypeptide of claim
 1. 23. The assay kit of claim 22, further comprising a second container containing a second antibody that immunoreacts with the first antibody.
 24. The assay kit of claim 23, wherein the first antibody and the second antibody comprise monoclonal antibodies.
 25. The assay kit of claim 23, wherein the first antibody is affixed to a solid support.
 26. The assay kit of claim 23, wherein the first and second antibodies each comprise an indicator.
 27. The assay kit of claim 26, wherein the indicator is a radioactive label or an enzyme.
 28. An assay kit for detecting the presence, in a biological sample, of an antibody that specifically binds a platelet VDCC α₁ subunit polypeptide, the kit comprising a polypeptide of claim 1 that specifically binds the antibody, wherein the polypeptide is present in an amount sufficient to perform at least one assay.
 29. A method of detecting a nucleic acid molecule that encodes a platelet VDCC α₁ subunit polypeptide in a biological sample containing nucleic acid material, the method comprising: (a) hybridizing the nucleic acid molecule of claim 8 under stringent hybridization conditions to the nucleic acid material of the biological sample, thereby forming a hybridization duplex; and (b) detecting the hybridization duplex, whereby a platelet VDCC α₁ subunit polypeptide is detected.
 30. A method to determine the presence or absence of a mutation conferring altered VDCC α₁ subunit activity in a platelet, said method comprising the step of analyzing a nucleic acid or protein sample for the presence of a mutation in a nucleic acid molecule encoding the platelet VDCC α₁ subunit polypeptide of claim
 1. 31. The method of claim 30, further comprising: (a) amplifying nucleic acid molecules in said sample using a nucleic acid amplification method and primers that selectively amplify said nucleic acid molecule encoding a platelet VDCC α₁ subunit polypeptide; and (b) identifying whether a mutation is present in said amplified nucleic acid molecule.
 32. The method of claim 31, further comprising the step of analyzing a protein sample for the presence of a mutation in a platelet VDCC α₁ subunit polypeptide.
 33. A method for detecting a polymorphism in a nucleic acid molecule that encodes a platelet VDCC α₁ subunit polypeptide, the method comprising: (a) amplifying nucleic acid molecules in said sample using a nucleic acid amplification method and primers that selectively amplify said nucleic acid molecule encoding a platelet VDCC α₁ subunit polypeptide; and (b) identifying whether a polymorphism is present in said amplified nucleic acid molecule.
 34. A kit for detecting a polymorphism in a nucleic acid molecule encoding a platelet VDCC α₁ subunit polypeptide, the kit comprising: (a) a reagent for detecting a polymorphism in a nucleic acid molecule encoding a platelet VDCC α₁ subunit polypeptide in a biological sample; and (b) a container for the reagent.
 35. The kit of claim 34, further comprising a reagent for amplifying a nucleic acid molecule encoding a platelet VDCC α₁ subunit polypeptide.
 36. The kit of claim 35, wherein the amplifying reagent comprises a polymerase enzyme suitable for use in a polymerase chain reaction and a pair of oligonucleotides.
 37. The kit of claim 35, further comprising a reagent for extracting a nucleic acid sample from a biological sample obtained from a subject.
 38. A method of screening candidate substances foran abilityto modulate platelet VDCC α₁ subunit biological activity, the method comprising: (a) establishing a test sample comprising a nucleic acid molecule encoding a platelet VDCC α₁ subunit polypeptide; (b) administering a candidate substance to the test sample; and (c) measuring the interaction, effect, or combination thereof, of the candidate substance on the test sample to thereby determine the ability of the candidate substance to modulate platelet VDCC α₁ subunit biological activity.
 39. The method of claim 38, wherein the candidate substance is a candidate protein, a peptide, an antibody, a nucleic acid, or a chemical compound.
 40. The method of claim 39, further comprising the step of purifying and isolating a gene encoding the candidate polypeptide.
 41. The method of claim 39, wherein the platelet VDCC α₁ subunit polypeptide is contained within cells in cell culture.
 42. A recombinant cell line suitable for use in the method of claim
 41. 43. The method of claim 38, further comprising a modulatable transcriptional regulatory sequence of a platelet VDCC α₁ subunit polypeptide-encoding gene and a reporter gene which is capable of producing a detectable signal, wherein a candidate substance as a modulator of platelet VDCC α₁ subunit biological activity is based on the amount of signal produced in relation to a control sample.
 44. The method of 43, wherein the reporter gene encodes the platelet VDCC α₁ subunit polypeptide of claim
 1. 45. A method of modulating platelet VDCC α₁ subunit polypeptide biological activity in a cell, the method comprising administering to the cell an effective amount of a substance capable of modulating activity of a platelet VDCC α₁ subunit polypeptide in the cell to thereby modulate platelet VDCC α₁ subunit polypeptide biological activity in the cell.
 46. The method of claim 45, wherein the cell is a platelet or a megakaryocyte.
 47. The method of claim 46, wherein the cell comprises a cell in a vertebrate subject.
 48. The method of claim 47, wherein the vertebrate subject is a mammal.
 49. The method of claim 45, wherein the step of administering further comprises administering an effective amount of a substance that modulates expression of a platelet VDCC α₁ subunit polypeptide-encoding nucleic acid molecule in the cell.
 50. The method of claim 45, wherein the substance that modulates the platelet VDCC α₁ subunit biological activity comprises an anti-platelet VDCC α₁ subunit polypeptide antibody, a polypeptide, a peptide, a chemical compound, or a nucleic acid.
 51. The method of claim 50, wherein the nucleic acid substance that modulates expression of a platelet VDCC α₁ subunit polypeptide-encoding nucleic acid molecule comprises an antisense oligonucleotide.
 52. The method of claim 50, wherein the polypeptide, peptide, or chemical compound substance that modulates expression of the platelet VDCC α₁ subunit polypeptide-encoding nucleic acid molecule comprises a ligand for a modulatable transcriptional regulatory sequence of a platelet VDCC α₁ subunit polypeptide-encoding nucleic acid molecule.
 53. A pharmaceutical composition comprising a therapeutically effective amount of a modulator of a biological activity of a platelet VDCC α₁ subunit polypeptide, and combinations thereof, and a pharmaceutically acceptable diluent or vehicle.
 54. The pharmaceutical composition of claim 53, wherein the platelet VDCC α₁ subunit polypeptide-biological-activity-modulator preferentially binds a platelet VDCC α₁ subunit polypeptide, or a fragment or derivative thereof.
 55. A method for modulating calcium transport in a cell, the method comprising introducing to the cell a construct comprising a nucleic acid sequence encoding a platelet VDCC α₁ subunit polypeptide operatively linked to a promoter, wherein production of the platelet VDCC α₁ subunit polypeptide in the cell results in modulation of calcium transport.
 56. The method of claim 55, wherein the construct further comprises a vector selected from the group consisting of a plasmid vector or a viral vector.
 57. The method of claim 55, wherein the construct further comprises a liposome complex.
 58. The method of claim 55, wherein the cell is a platelet or a megakaryocyte.
 59. The method of claim 55, wherein the cell comprises a cell in a vertebrate subject.
 60. The method of claim 59, wherein the vertebrate subject is a mammal.
 61. A transgenic non-human animal having incorporated into its genome a nucleic acid molecule encoding a human platelet VDCC α₁ subunit polypeptide, wherein the human platelet VDCC α₁ subunit polypeptide is expressed in the transgenic non-human animal.
 62. A transgenic non-human animal having modified or deleted from its genome a nucleic acid molecule encoding a platelet VDCC α₁ subunit polypeptide. 